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A Modern Treatment of the 15 Puzzle 


Aaron F. Archer 


1. INTRODUCTION. In the 1870’s the impish puzzlemaker Sam Loyd caused 
quite a stir in the United States, Britain, and Europe with his now-famous 
15-puzzle. In its original form, the puzzle consists of fifteen square blocks num- 
bered 1 through 15 but otherwise identical and a square tray large enough to 
accommodate 16 blocks. The 15 blocks are placed in the tray as shown in Figure 1, 
with the lower right corner left empty. A legal move consists of sliding a block 
adjacent to the empty space into the empty space. Thus, from the starting 
placement, block 12 or 15 may be slid into the empty space. The object of the 
puzzle is to use a sequence of legal moves to switch the positions of blocks 14 and 
15 while returning all other blocks to their original positions. 


Figure 1. The starting position for the 15-puzzle. The shaded square is left empty. 


Loyd writes of how he “drove the entire world crazy,” and that “A prize of 
$1,000, offered for the first correct solution to the problem, has never been 
claimed, although there are thousands of persons who say they performed the 
required feat.” He continues, 


People became infatuated with the puzzle and ludicrous tales are told of 
shopkeepers who neglected to open their stores; of a distinguished clergyman 
who stood under a street lamp all through a wintry night trying to recall the 
way he had performed the feat.... Pilots are said to have wrecked their 
ships, and engineers rush their trains past stations. A famous Baltimore 
editor tells how he went for his noon lunch and was discovered by his frantic 
staff long past midnight pushing little pieces of pie around on a plate! [9] 


The reason for this hysteria, of course, is that Loyd’s puzzle has no solution. Each 
move causes a transposition of the 16 blocks (where the empty square is consid- 
ered to contain a blank block), and for the blank to end up in the lower right 
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corner requires an even number of moves, so the resulting permutation is even. 
But the desired end placement is an odd permutation of the original, and is hence 
unobtainable. One must assume Sam Loyd knew this, and from there one can 
only conjecture how much amusement he derived from driving the American 
public insane. 

The puzzle has inspired a sizable number of articles and references in the 
mathematical literature. The first of these is-a pair of articles published in the 
American Journal of Mathematics in 1879 by W. W. Johnson [7] and W. E. Story 
[13]. Johnson’s article is an explanation of why odd permutations of the puzzle are 
impossible to obtain, while Story’s article proves that all even permutations are 
possible. The editors were apparently so apprehensive and defensive about pub- 
lishing articles on what some might charge to be a frivolous topic that they 
attached the following justification to the end of Story’s article: 


The “15” puzzle for the last few weeks has been prominently before the 
American public, and may safely be said to have engaged the attention of 
nine out of ten persons of both sexes and of all ages and conditions of the 
community. But this would not have weighed with the editors to induce them 
to insert articles upon such a subject in the American Journal of Mathemat- 
ics, but for the fact that the principle of the game has its root in what all 
mathematicians of the present day are aware constitutes the most subtle and 
characteristic conception of modern algebra, viz: the law of dichotomy 
applicable to the separation of the terms of every complete system of 
permutations into two natural and indefeasible groups, a law of the inner 
world of thought, which may be said to prefigure the polar relation of left 
and right-handed screws, or of objects in space and their reflexions in a 
mirror. Accordingly the editors have thought that they would be doing no 
disservice to their science, but rather promoting its interests by exhibiting this 
a priori polar law under a concrete form, through the medium of a game 
which has taken so strong a hold upon the thought of the country that it may 
almost be said to have risen to the importance of a national institution. 
Whoever has made himself master of it may fairly be said to have taken his 
first lesson in the theory of determinants. [13, p. 404] 


The puzzle is a popular topic for books on recreational mathematics or mathemati- 
cal potpourri, such as [1], [2], [4], [5], [9], and [12], most of which use it as an 
example to illustrate the consequences of even and odd permutations, as does [14]. 
Various sources have suggested variants of the 15-puzzle, including [3], [4], [6], [8], 
[10], and [15]. Today the puzzle appears on some computer screen savers, and a 
version is distributed with every Macintosh computer. 

Most references to the 15-puzzle explain the impossibility of obtaining odd 
permutations and many state Story’s result that every even permutation is indeed 
possible, but this author found only three proofs. R. M. Wilson [15] published a 
more general result in 1974, which we discuss at the end of this article. Ball and 
Coxeter’s book [1] refers to [10] for a proof, but the article does not fulfill the 
promise. The arcane terminology of Story’s article [13] renders it difficult to wade 
through, and of course it does not take advantage of modern notation developed 
since then. Spitznagel [11] published a proof in 1967, but later wrote that “Over 
the years there have been published a number of unnecessarily complicated 
explanations of the puzzle. I confess that I myself once published one of these 
overly complicated accounts” [12]. Indeed, Herstein and Kaplansky [5] write that 
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“No really easy proof seems to be known.” This article intends to rectify 
that deficiency. 


2. SOLUTION. It should be noted that the proof provided here was developed 
independently of the previous proofs, but coincidentally shares some ideas with 
Story’s proof [13]. 

We call each of the 15 pieces blocks, and the 16 different squares on the board 
we call cells. For reasons that soon become apparent, we number the cells in the 
snakelike pattern shown in Figure 2. We can think of the empty cell as being 
occupied by a blank block. Each legal move then consists of “moving the blank,” 
that is, exchanging the blank block with one of its horizontal or vertical neighbors. 
A placement is a bijection from the set of blocks (including the blank) to the set of 
cells—in other words, a snapshot of the board between moves. Given an initial 
placement, we wish to determine what other placements are attainable through a 
sequence of legal moves. 


Figure 2. The dashed line and the numbers in the corner of each cell indicate a special ordering of the 
cells that we use to define equivalence classes of placements. 


Notice that by moving the blank block along the snaking path of Figure 2 we can 
move the blank to any cell without changing the order of the remaining blocks 
along this path. This leads us to define an equivalence relation on the set of 
placements, two placements being equivalent if we can obtain one from the other 
by moving the blank along the snaking path. Each equivalence class is called a 
configuration, and contains 16 placements, one for each cell the blank can occupy. 
If block i occupies cell j and the blank occupies a higher numbered cell, then we 
say block i is in s/ot j; otherwise it is in slot (j — 1). Refer to Figure 3 for an 
example. All placements in a given configuration have the 15 blocks in the same 
slots, so we can denote a configuration by [a,,...,a@,<], where a; is the slot that 
block « occupies in the configuration. 

Every move of the blank block effects a permutation on the slots occupied by 
the blocks. For example, moving the blank from cell 10 to cell 15 causes the 
permutation (10, 11, 12, 13,14) because the block originally in cell 15 (slot 14) is 
moved to cell 10 (which becomes slot 10) and the blocks in cells 11 through 14 are 
bumped up one slot. A configuration [a,,...,4a,.] subjected to the permutation o 
is transformed into the configuration [a,,...,a,<lo =|a,o0,...,a,,o]; since our 
permutations act on the right, we multiply them left to right. See Figure 3 for 
an example. 
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Figure 3. The placement shown here corresponds to the configuration C = [1, 2, 3, 4, 8, 7, 6, 
5, 14, 12,13, 10, 15,11,9]. Since the initial placement of Figure 1 corresponds to the configuration 
I = (1, 2,3, 4,8, 7, 6,5, 9, 10, 11, 12, 15, 14, 13], subjecting the initial configuration to the permutation 
ao = (9,14, 11, 13)(10, 12) yields C. This is an even permutation, so by Theorem 3, C is obtainable 
from J, 


Let o;; denote the ais ieath achieved by ee the blank from cell i to 
cell j. Then clearly o; ;,, is the identity, and o, ; = ‘a: This leaves 9 permuta- 
tions for us to work out. These are tabulated in Table 7 The key point is that one 
can move the blank along the snaking path of Figure 2 to any cell without changing 
the configuration. Therefore, the first nine permutations listed in Table 1 and 
their inverses may be applied in any order, so the problem reduces to identify- 
ing the subgroup of S,; (the symmetric group on the 15 slots) generated by 
these permutations. We prove that these permutations generate A,, (all even 


permutations). 


TABLE 1. A summary of all possible permutations of slots attained by moving the blank block. 
Moving the blank from cell / to cell j effects the permutation oj, ;. 
og = (1,2, 3, 4,5, 6, 7) 
7.7 = (2, 3, 4, 5, 6) 
= (3,4,5) 
= (5, 6, 7, 8,9, 10, 11) 
= (6, 7, 8, 9, 10) 
= (7,8, 9) 


To 16 = (9, 10, 11, 12, LS, 14, 15) 
Oj 1 A NZ, sg 1 


a oe a 
.; = Oj; for all relevant / > / 


Lemma 1. Forn > 3 the 3-cycles generate A,,. 


Proof: By definition, all elements of A, can be written as a product of an 
even number of transpositions. If a, b, c, and d are distinct, then (a, b) (c, d) = 
(a, b,c) (a, d,c), (a, b) (b, c) = (a,c, b), and (a, b) (a, b) = id. cs 


For n => 5, Lemma 1 also follows directly from the fact that A, is simple, since 
the set of 3-cycles is closed under conjugation. Let us call a 3-cycle consecutive if it 
is of the form (k,k + 1,k + 2). 
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n — 1,n)} generate A,,. 


Lemma 2. For n > 3, the consecutive 3-cycles {(1, 2, 3), (2, 3,4),...,( — 2, 


Proof: Since the 3-cycles generate A,, it suffices to show that the consecutive 
3-cycles generate all 3-cycles. This is trivial for n = 3. For n => 4 we see by 
induction that we can generate all 3-cycles not containing both 1 and n. To 
generate (1,x,7), let y € {1,...,n}\ (1, x, n}. Then (1, x, 2) = (y, x, WU, x, y). 
Of course, (1,7, x) = (1, x, n)’. a 


Theorem 3. The cycles listed in Table 1 generate A,.. 


Proof: Since all the cycles are odd, they are even permutations, so they generate a 
subgroup of A,;. Note that for any permutation o we have o7'(a,,...,4,)o = 
(a,0,...,a,0). Thus, 


(1,2,...,7) °(3,4,5)(1,2,...,7)" yields (1,2,3),...,(5,6,7); 


—N 


(5,6,...,11) °“(7,8,9)(5,6,..., 11)” yields (5,6,7),...,(9, 10,11) and 
(9,10,...,15) “(11, 12, 13)(9, 10,..., 15)" yields (9,10,11),..., (13, 14, 15) 
aS 7 assumes the values —2,—1,0,1, and 2. This constitutes all consecutive 
3-cycles in $,;, so by Lemma 2 it generates A,.. = 


Thus, given any two placements P/, and Pl, belonging to configurations Cf, 
and Cf,, respectively, Pl, is obtainable from P/, if and only if Cf, is an even 
permutation of Cf,. Stated directly in terms of the placements, we see that if Pl, 
and Pl, have the blank in the same cell then P/, is obtainable from P/, if and only 
if Pl, is an even permutation of the 15 numbered blocks in Pl,. Let n be the 
number of moves the blank cell in Pl, is away from the blank cell in Pl,. Since 
each move of the blank block causes a transposition of two blocks, then for n odd 
(respectively even) PI, is obtainable from P/, if and only if Pl, is an odd 
(respectively even) permutation of the 16 blocks in P1,. 


3. GENERALIZATIONS. What follows is, in some sense, the broadest generaliza- 
tion of the 15-puzzle. Given any connected graph on n vertices, we can label the 
vertices with n labels, one of which we call the blank label. Each move consists of 
interchanging the blank label with the label on an adjacent vertex. We then ask 
which of the ! labelings may be obtained from a given initial labeling through a 
sequence of moves. More precisely, we ask what permutations of the (7 — 1) 
ordinary labels (a subgroup of S,,_,) can be obtained by a sequence of moves that 
returns the blank to its original vertex v (since the subgroups obtained for 
different choices of v are isomorphic). The 15-puzzle is a special instance of this, 
corresponding to the graph P, x P, (the cartesian product of the path on four 
vertices with itself) depicted in Figure 4. The vertices correspond to cells, the 
labels (not depicted) correspond to blocks, and the edges show which cells 
are adjacent. 

The crux of the method presented in Section 2 lies in inducing equivalence 
classes and defining slots by the position of the blank along a hamiltonian path 
(a path that visits every vertex of the graph exactly once). The method is applicable 
to any graph containing a hamiltonian path, using any such path. Thus, for the 
15-puzzle we could have used a spiral instead of the serpentine pattern of Figure 2. 
Another example is the Petersen graph. Numbering the vertices as in Figure 5, we 
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Figure 4. The graph P, x P,. 


) 10 


Figure 5. For the famous Petersen graph, each labeling is obtainable from every other by a sequence of 
legal moves. The vertices are numbered to indicate a hamiltonian path. 


see that our desired group is generated by 0) 9, O15; 02,7, 03, 19 Gg, ANd Ge 49; 
where o;, eS (i,i + 1,...,j7 — I) is the permutation of slots effected by moving the 
blank label from vertex i to vertex j. Some calculation shows that the group 
generated is all of $,; [15] explains why this is no coincidence. 

We now discuss the general case, where the graph may or may not contain a 
hamiltonian path. If the graph contains a cut vertex v then none of the labels other 
than the blank may be moved across v, so the problem decomposes into two parts. 
Thus, it suffices to consider graphs containing no cut vertices. 

In [15], R. M. Wilson solves this case completely. Wilson’s amazing result is that 
with the exception of cycles C,, and the graph 6, depicted in Figure 6, the group 
contains A,_,. Clearly the group contains an odd permutation if and only if the 
graph contains an odd cycle, that is, the graph is not bipartite. So for bipartite 
graphs the group is exactly A,,_,, and otherwise it is all of S,,_,. Thus, aside from 
the two exceptional cases, either exactly half or all of the n! labelings are 
obtainable, depending on whether or not the graph is bipartite. For 6,, the desired 
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Figure 6. The graph 6). 


group is PGL,(Z/5Z) acting on the projective line over Z/5Z (a group of order 
120 acting 3-transitively on a set of six elements), yielding six inequivalent label- 
ings. For C,, the group is ((1,2,...,2 — 1)), yielding (m — 2)! inequivalent label- 
ings. The existence of such a simple complete characterization is surprising. 
However, Wilson’s proof, while elegant, requires considerably more sophisticated 
mathematics than the simple and elementary proof provided here for the special 
case of the 15-puzzle. 
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Effects of Calculus Reform: 
Local and National 


James F. Hurley, Uwe Koehn,.and Susan L. Ganter 


1. INTRODUCTION. Over the past decade, the large-scale effort to reform 
calculus instruction has been a prominent feature of mathematics education. It 
arose aS an energetic response to criticism of the calculus curriculum that culmi- 
nated in the 1987 conference Calculus for a New Century. Substantial support from 
the National Science Foundation stimulated development and implementation of 
new calculus curricula at many institutions, including the University of Connecti- 
cul. 

The instructional practices of calculus-reform programs differ markedly from 
those that had persisted for decades (some would say, centuries). It is only natural 
for faculty to question whether the new modes really improve the approach that in 
their own education worked successfully. Some observe little if any improvement in 
conceptual understanding among students from reform courses, and even complain 
of lessened ability to use computational techniques. They contend (for instance, see 
[29]) that much of what passes for reform really amounts to “dumbing down” a 
formerly demanding but honest grounding in the power of calculus to a sloppy, 
imprecise, even misleading shadow of the true nature of the subject. Reformers 
counter that rote “plug-and-chug” hand symbolic calculation is as intellectually 
stultifying to teach as to learn, and that their teamwork-fostering technological 
tools and compelling connections to a broad spectrum of “real-life” issues moti- 
vate students to become active learners. They assert that their students emerge 
from calculus with superior understanding and greater capacity to use its methods 
successfully, and hence better prepared to complete degrees in mathematics, 
science, and engineering. 

Passionate advocacy of such positions can make for lively lunch-room entertain- 
ment, but provides little objective basis for shaping an optimal calculus curriculum. 
The present paper examines the impact of calculus reform at one medium-sized 
State university. It also considers the effect of reform at several other institutions. 
It concludes with a discussion of broader national implications of those outcomes. 

The following section describes the features of Connecticut’s reform project and 
includes some representative examples of its activity. For more details, consult [14], 
[15], and the instructor guides for the latter [16]. Ensuing sections discuss data 
from common final examinations at Connecticut, results of a five-year longitudinal 
study of students who took traditional and reform versions of calculus during the 
project’s first year, and results from similar analyses at other sites. The concluding 
section discusses the national picture, and the kind of further studies appropriate 
to appraising the impact of calculus reform. 


2. THE UNIVERSITY OF CONNECTICUT PROJECT. In 1989, with support 
from the National Science Foundation Instrumentation and Laboratory Improve- 
ment Program, the University’s Research Foundation, and the State of Connecticut 
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High-Technology Program, the Department of Mathematics opened a new com- 
puter laboratory. This facility made it possible to expand computer integration 
from the honors sequence to main-track calculus. Such expansion was a major 
recommendation of the MAA Committee on Calculus Reform and The First Two 
Years (CRAFTY) after its site visit the previous year. CRAFTY examined the 
five-year-old approach in the honors course as one early model for reforming 
calculus instruction. A brief description of the resulting project follows. 

One class hour per week transformed to a computer-laboratory period, and an 
existing problem-discussion period evolved into a group problem-solving session. 
In both those settings, students work in self-selecting groups of 3-to-5, without 
referring to books or notes. In each, the instructor provides some guidance to 
student exploration and problem-analysis. In the problem hour, students attack a 
range of conceptual and computational questions, many of which are typical 
examination problems but a significant number of which are more challenging. In 
the computer laboratory, students pursue investigation of (generally new) topics by 
using local programs in the way their laboratory manuals suggest. The first 
semester, those programs are graphical and numerical True BASIC routines. 
Second-semester activity includes Maple worksheets and Mathematica notebooks. 
The remaining two weekly meetings resemble traditional lectures, but students 
have a more active role in discussion of examples. They often work together (with 
the aid of graphing calculators) to analyze and investigate problems or properties, 
and suggest how the instructor should proceed. 

At the very first laboratory session, students encounter the function f with 
formula 

oil 
fe oe for x # 0 (1) 
0 for x = 0 


A zooming program lets them examine the graph of f near the origin, and leads 
them to conjecture that it has a limit there; later, they use the sandwich theorem 
for limits to justify that conclusion. In a subsequent problem-solving hour, they 
work with similar functions for which graphing calculators provide enough infor- 
mation to suggest continuity or discontinuity. Again, they need to apply appropri- 
ate theory to support their conclusions. 

Local linearity plays a central role in discussion of differentiation, and the graph 
of the function f with formula (1) indicates its non-differentiability at x = 0. 
A group project asks them to investigate the function g in which x7 cos(1/x) 
replaces x sin(1/x) in (1). Another outgrowth of local linearity is the approach to 
implicit differentiation, which uses Euler’s method to construct both tables of 
values and approximate graphs of functional equations F(x, y) = 0 near a known 
(initial-value) point (xy, yj). This provides a first experience with numerical 
solution of differential equations, as well as a ready tool for sketching the graph of 
equations such as x°/° + y*/? = 1. | 

Initial exposure to area and definite integration occurs in the laboratory, 
through interaction with a graphical /numerical program that presents the interac- 
tive screen in Figure 1. The goal is both to foster conceptualization of [’ f(x) dx as 
a limit of sums and to provide a means of computing the area under the graph of a 
continuous nonnegative function f over an interval [a,b]. Numerical integration 
thus emerges as a natural and effective means for evaluating definite integrals. 
_ This also sets the scene for the fundamental theorem as a remarkable link between 
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f(x) = x? 
i For n = 40 


sum of areas of rectangles is: .333281 
2 Click mouse inside region to continue. 
Click mouse outside region to stop. 


Figure 1 


a limiting process and antidifferentiation. Students use numerical summation to 
estimate [j dx/( + cosx), and the corresponding laboratory project considers the 
accuracy of the right-endpoint approximation R,(f) and midpoint approximation 
M_(f). Students investigate [J — R,(f)|/h, | - M,(f)|/h, and [I -— M(f)| /h?, 
where I = {f(x) dx is known, and discuss the order-of-magnitude accuracy those 
ratios suggest. 

In the second semester, log-log and semilog plotting illustrate the usefulness of 
logarithmic and exponential functions in modeling, as most students are actually 
using those tools in their chemistry laboratory work. Mathematica’s Apart com- 
mand removes the algebraic tedium from complex partial-fraction decomposition. 
Numerical summation appears again, to highlight a perceptible difference between 
the series L*_,1/n and Y*_,1/n’. Laboratory activity explores s,, —s,, and the 
ensuing project asks for an explanation of why for the harmonic series that 
difference not only does not tend to 0 but in fact seems to converge to In 2. 

Years of teaching experience convinced the project faculty that student percep- 
tions of the importance of material derives in large measure from testing. That 
prompted them to include conceptual (that is, non-computational) questions regu- 
larly not only on discussion worksheets, homework, and group projects but also on 
quizzes and examinations. For example, the first hour exam in 1989 contained the 
following true-false questions. 


¢ If lim, ,.f(x) = L then lim, , .-f(x) =L 
¢ If f is continuous at x =c then it must be differentiable there 

¢ If f is continuous on J = (2,3) then it must have a maximum value on J 
¢ If f is not continuous at x = c then it cannot be differentiable there 


¢ If f is not defined at x =c then lim, , . f(x) cannot exist 


v7 C 
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The common final examination that term showed the graph of a nonnegative 
function f on a grid over an interval [a, b] and provided five intervals from which 
the students had to choose the one containing /) f(x) dx. A bonus question asked 
for a function f and a number a such that /*f(t) dt = tanx — 1. The next year’s 
first hour exam asked whether lim, , ,x° cos(1/x) exists, and for justification of 
the answer. The 1991 final examination provided grids on which to sketch the 
eraphs of a function f and its derivative given information about the signs of f’ 
and f”, limiting information for f, and some function values of f and f’. 


3. COMMON FINAL EXAMINATION PERFORMANCE. Every semester from 
Fall, 1989 through Spring, 1994, all first-year calculus students at the University’s 
main campus—whether in experimental or traditional sections—took a common 
final examination. There was also a common text for all sections: [12] for the first 
three years and then [13]. With the exception of Fall, 1991, when an instructor of a 
traditional section prepared the common final examination alone, members of the 
calculus-reform team who taught experimental sections participated with the other 
instructors in making up the final examination. There were never computer-specific 
questions, but there were always both conceptual questions like those in Section 2 
and traditional computational and procedural items. In an effort to measure any 
possible trade off between conceptual understanding and computational skills, the 
second-semester examinations tended to put greater emphasis on the latter to 
balance the heavier conceptual flavor of the first-semester examinations. Table 1 


TABLE | 


Standard Section Experimental Section Overall Standard 
Mean Mean Deviation 


1989 Fall 
1990 Spring 


1990 Fall 
1991 Spring 
1991 Fall 
1993 Fall 
1994 Spring 


presents the data from the results of the common examinations. Although the size 
of the difference is somewhat variable, for every semester the mean score in the 
experimental sections was higher than that in the standard sections. This suggests 
that better conceptual understanding in the project sections did not seem to come 
at the expense of weaker hand-computational skills. 

For 1989-90, part of the analysis for a longitudinal study (see Section 4) showed 
no significant difference between students in the experimental and traditional 
sections in terms of such success predictors as high-school rank and SAT scores. 
Such analysis was not carried out for intervening years, but in [5] Mary Ann 
Connors of the University of Massachusetts reports on a comprehensive analysis of 
the final examination data from 1993-94, That was the first year in which the 
computer-integrated calculus sequence existed as a separate course (instead of 
experimental sections within a single first-year calculus sequence). That Fall the 
two courses were of nearly equal size, a very different distribution from that in 
Fall, 1989, when 90% of the students were in traditional sections. Enrollment in 
the two versions was not random: students were free to elect either, and, although 
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rare, switching between them was possible. Most enrollees were entering fresh- 
men, and despite distribution of full descriptions of the two variants of the course, 
at the first meetings most students were not aware of any format differences. 
Analysis of SAT Mathematics and Verbal scores again showed no significant 
differences between the two populations. Males entered both courses with signifi- 
cantly higher SAT math scores than the females. 

Connors determined that the difference in final-examination performance was 
significant (p < 0.034), and that the performance of both males and females from 
the computer-integrated course was superior to that of their counterparts from the 
traditional version. Her analysis further revealed that in the computer-integrated 
course the mean score for female students (61%) was almost identical to the 
male-student mean (61.4%). Moreover, those female students had a mean SAT 
Math score (576) lower than both the females (596) and the males (601) in the 
traditional course. Yet on the final examination they outperformed both those 
groups, whose respective mean scores were 56.6 and 57.6. As Connors observed, 
this suggests that the female students benefited more from the computer- 
integrated experience than did the males. Analysis of performance within the 
respective courses further supports that. Male students in the computer-integrated 
_course, for instance, had a mean SAT mathematics score of 625, but a mean final 
examination score just four-tenths of one percent higher than the female mean. 


4. CALCULUS AS A PUMP TO TECHNICAL MAJORS. Consistently higher 
performance of students from reformed sections on concept-rich common final 
examinations suggested meaningful short-term benefit from the new approach, but 
to gauge the longer-term effect of the new instructional mode the authors focused 
on a basic theme of Calculus for a New Century. The welcoming message [21] from 
Dr. Frank Press, President of the National Academy of Sciences, to Calculus for a 
New Century cited the role of calculus as a giant filter that knocked many students, 
especially minorities and women, out of the pipeline to the technical work force of 
the 21st century. In what was to become the rallying cry of calculus reform, 
Dr. Robert White, President of the National Academy of Engineering [28] fol- 
lowed those remarks with a call to transform calculus from a filter into a pump. 

To measure persistence and success in majors in the mathematical, physical, 
and engineering sciences, the authors first surveyed the requirements of all the 
science and engineering major programs at the University. From that study they 
compiled a set of 32 key courses with first-year calculus as prerequisite. Those 32 
courses are essential to completion of majors in the fields of interest, and they 
require students to use the quantitative, analytical, and problem-solving skills that 
calculus aims to impart. The post-calculus careers of all students who enrolled in 
first-semester calculus in the Fall of 1989 (m = 579) were tracked for the subse- 
quent five academic years. The population was divided into two cohorts for the 
purposes of this analysis: those who took at least one semester of computer- 
integrated calculus and those who took two semesters of traditional calculus during 
1989-90. The data provided a list of the key courses in which each student 
subsequently enrolled. Among all students who took first-year calculus in 1989-90, 
approximately 59% of males took at least one such course, and approximately 43% 
of females did so. A detailed analysis of each student’s performance in all such 
later courses was performed. A general linear model was fitted to the number of 
subsequent courses, and the results were checked for consistency with other 
methods (Mann-Whitney-Wilcoxon and Savage) and by means of transformations 
such as square-root and logarithmic. 
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With Mathematics SAT score, enrollment in the project’s sections of calculus 
was one of only two factors that correlated significantly with persistence in 
technical majors among all students and among male students. For female stu- 
dents, it was the only statistically significant factor the study found to correlate 
with such persistence. 

SAT scores, high-school class ranks, and other predictors of success in technical 
majors were compared both for students who took one or two semesters of the 
experimental version of calculus and for those who took two semesters in 
the traditional format. There were no significant differences in SAT scores 
or class rank, although in both categories the mean was slightly higher for students 
in the experimental sections. However, there was a significant difference (t-test 
p < 0.0162) between the two cohorts in the number of key post-calculus major 
courses completed. Among students in the traditional sections, the mean number 
of such courses was 3.50. Among students in the computer-integrated sections, it 
was 4.95, that is, more than 40% higher. Under the general linear models 
procedure with SAT scores and high-school rank as covariates, the p-value was 
0.0580. 

The mean grade earned in those courses was not significantly different between 
the two groups, indicating that students from the traditional sections who persisted 
in technical majors seem to have acquired adequate preparation from their 
first-year calculus courses. 

Further analysis revealed additional interesting information. That study consid- 
ered several factors, including SAT Verbal score, SAT Math score, total SAT 
score, high-school rank, and various combinations of two or more of those in 
relation to post-calculus persistence in technical fields. For all students, the SAT 
Math score was a very significant predictor of persistence in such majors 
(p < 0.0001). However, restricted to females it was not significant. In fact, the only 
significant such factor (p < 0.0267) for them that this study identified was enroll- 
ment in an experimental section of first-year calculus. By contrast, among males 
only the SAT Math score was a significant (p < .0024) factor in persistence. 


5. RESULTS FROM OTHER SITES. Encouraging as the foregoing results at 
Connecticut may be, in themselves they provide information about the experience 
of just a single institution. The authors investigated the question of how common 
the Connecticut experience might be at calculus-reform sites. They solicited data 
from many sources and examined the literature for reports of studies that ad- 
dressed such areas as persistence in technical fields, performance on common 
examinations and in later courses, and similarities and differences in outcomes 
between males and females. 

The institution with the most similar approach to Connecticut’s is Dartmouth, 
where True BASIC originated. While the main thrust of [1] is a description of 
Dartmouth’s approach and presentation of sample materials, it does mention an 
experiment in 1988, a year before Connecticut’s longitudinal study started. Dart- 
mouth gave precisely the same ¢raditional final examination in its first-term 
introductory calculus course as it had the previous year, when the degree of 
computer use in the course was substantially lower. It found no differences in 
performance between the two classes, which led to the Hippocratic conclusion that 
at least its computer integration appeared to have done no harm. 

At the United States Naval Academy, a study compared results of common final 
examinations for sections that used the calculus-reform approach of the Harvard 
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Consortium to those for sections that used a traditional text and approach [19]. 
Students were randomly assigned to the sections, with no switching possible. The 
final examination included a common block of ten multiple-choice equations of 
both conceptual and computational types. Students from the reform sections 
scored better overall on nine of those ten questions, with statistically significant 
disparities in six of them. There was little or no underperformance on traditional 
computational questions, but significantly better performance on conceptual ques- 
tions. These results are consistent with the conclusion that the reform mode could 
foster better learning, especially of conceptual areas of calculus. 

Baylor University, another reform site that adopted the Harvard Consortium 
approach, also appraised the effect by using a common final examination for 
reform and traditional students [25]. Its analysis incorporated an index that divided 
students by success predictors (SAT Math, ACT Math, and local calculus place- 
ment exam scores). Even though a// common exam questions came from the 
traditional course’s topical coverage and assignments, mean scores among all levels 
of students in reform sections surpassed those of the traditional sections. On only 
two of 20 items was the mean score higher in traditional sections. 

A study at the United States Merchant Marine Academy analyzed grades of 
students in traditional and reform versions of calculus throughout the sequence 
during the years 1990-91 (traditional) and 1991-92 (reform) [22]. Overall, the 
reform-course students earned consistently higher grades than did their predeces- 
sors in the traditional version. For students with weaker SAT and calculus- 
readiness examination scores, that improvement was striking. By contrast, students 
with the best readiness-exam scores actually did slightly worse in reform calculus, 
perhaps reflecting the lessened prominence of algebraic manipulation in the latter. 

The C'L (Calculus, Concepts, Computers, Cooperative Learning) program at 
Purdue has used a two-pronged approach to evaluate its impact. Besides qualita- 
tive research on student learning, the C*L project conducted a longitudinal study 
similar to Connecticut’s over the period 1988-91 [23]. Among the variables it 
studied were the number of post-calculus mathematics courses students took and 
the grades they earned. Like Connecticut’s study, it found that project students 
took more of those courses, with no significant differences in grade performance. 

The University of Illinois at Chicago compared grade performance in subse- 
quent technical courses among students who took a traditional version of calculus 
in 1994-95 to that of students who took a reformed version of calculus (again, the 
Harvard approach) in 1995-96. Results of that study showed that students from 
reformed calculus performed significantly better in physics courses taken immedi- 
ately after calculus, with diminishing differences in later courses [2]. 

SUNY Stony Brook compared percents of first-semester students who contin- 
ued into the second semester of calculus before and after adoption of a reformed 
calculus program. Results from the study show significant differences over two 
three-year periods (1988-91 and 1992-95) [20]. In the first three years, the “yield” 
(percent of first-semester students who continued to the second semester) was, 
respectively, 36.4%, 52.6%, and 52.5%. Over the next three years, those figures 
improved to 63.1%, 59.8%, and 63.4%. One caveat: markedly higher course grades 
in the reformed course could be a significant stimulant to the improvement, and it 
is difficult to compare grading standards in the two quite different settings. 

Michigan’s reform project (Harvard materials) looked at a similar retention 
pattern. Results showed significant increases the first two years (when project 
section grades were also significantly higher), but a reverse in the third year when 
that grade disparity disappeared [4]. Significant attitudinal differences on the part 
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of the students in the reformed version were also identifiable. Those in the reform 
project’s sections showed more positive attitudes about calculus generally and their 
course experience in particular. 

The Duke project (Project CALC) carried out one of the most comprehensive 
assessment studies, which measured attitude, subsequent enrollment patterns, 
retention of knowledge, and other factors. No short summary here can do justice 
to the copious data in that project’s final evaluation [3]. Suffice it to note that 
results of the retention tests show a nearly uniform significant advantage in favor 
of the project students. Their attitudes, problem-solving, and conceptual under- 
standing consistently surpassed those of traditional-section students. However, 
performance on computational skills was inferior, by a margin that fell just short of 
Statistical significance. Project students again took significantly more technical 
courses, but students from traditional sections had a slightly higher grade-point 
average in those courses. Significantly more project students than traditional ones 
took more than two post-calculus courses. 

What about the gender-specific aspects of the Connecticut project? The positive 
results for females in the Connecticut study are consistent with reports of similar 
effects in projects that incorporate graphing-calculator technology; see [7] and [18]. 
An interesting question that comes from such data: what aspects of technology 
enhance female learning in quantitative subjects? One possible theory from 
informal discussions is the prominent role of cooperative learning in many calcu- 
lus-reform projects. Computer-laboratory sessions and related projects lend them- 
selves naturally to group work, which several studies (among them [11]) have 
correlated with improved performance by females in science and mathematics 
courses. 

Finally, an earlier article in this MONTHLY reported about calculus reform at 
Oklahoma State University [17]. That study found that—as at the Merchant 
Marine Academy, SUNY Stony Brook, and Michigan—grades in the reform 
version of calculus were higher than in traditional sections, which in the Oklahoma 
State study were contemporaneous. Little difference in subsequent enrollment 
patterns within the calculus /differential equations sequence was observed. However, 
the study did not encompass a sufficiently long time span to measure persistence in 
and completion of required courses for technical majors, as in the Connecticut, 
Purdue, and Duke studies. However, its results were consistent with those from the 
Merchant Marine Academy study in one respect: lower grades in traditional 
sections of Calculus II for students from reform sections of Calculus I. It reported 
insufficient data for higher-level course performance to permit meaningful conclu- 
sions, but stated that fewer students from the reform sections earned C or better 
grades in differential equations or linear algebra. It concluded that traditional-sec- 
tion students tended overall to do better in subsequent mathematics courses, but 
gave no information about the statistical significance of their performance. 

The preponderance of evidence from the Connecticut study—as well as the 
evidence cited in this section—is consistent with the conclusion that the impact of 
calculus reform has been positive. If the “filter-to-pump” goal is an appropriate 
assessment standard, then the results from Connecticut and other calculus-reform 
sites suggest that calculus with modern computational and pedagogical features 
can promote better end-of-course mastery, significantly improve the flow rate into 
the technical work force, and foster more gender diversity in that flow. Note the 
caution in these conclusions: is consistent with and suggest, rather than imply or 
prove. Unlike mathematics, appraisal of pedagogical change is a highly inexact 
science, lacking not only proofs but even generally accepted rules of inference. 
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The positive content of the previous paragraph notwithstanding, it is only fair to 
mention that at many institutions the fruits of calculus reform have not included 
substantially higher performance and persistence [9]. Experimenters whose results 
fall short of hopes and expectations are certainly less likely to write up and 
disseminate accounts of their work than those with more pleasing performance 
data to report. There is also considerable belief in the Hawthorne effect, that is, 
that expenditure of time and attention on a project aiming to improve outcomes 
leads to actual improvements, which to at least some extent result from the 
attention itself rather than the project’s methods. On the other hand, the atmos- 
phere of passionate debate mentioned in Section 1 certainly is receptive to a 
well-designed and well-documented study showing predominantly /ower perfor- 
mance and/or persistence by students in a reformed approach to calculus. The 
authors are not aware of any such study. 


6. GENERAL NATIONAL IMPACT. On a procedural level, there is clear evi- 
dence of substantial change in the teaching, learning, and testing of calculus since 
the 1980’s. The appearance and spread of graphing calculators and powerful 
desktop, laptop, and now notebook and subnotebook computers has noticeably 
stimulated greater emphasis on the graphical and numerical aspects of calculus. 
The popularity of projects and written reports has also brought a new verbal 
dimension to the subject. Even current editions of texts that early reformers cited 
as contributors to the calculus crisis incorporate (and advertise) numerical, graphi- 
cal, and verbal components. Calculator and computer supplements, which were 
virtually nonexistent at the start of the calculus-reform movement, have now 
become pervasive. It is in fact challenging to identify a current calculus text that in 
1987 would have been labeled “traditional.” 

National examinations reflect national norms about mastery of the subjects they 
test. Prior to the calculus reform movement, the Advanced Placement calculus 
program banned calculators. The current AP exams require them! A similar situa- 
tion will shortly exist in the new Graduate Record Examination’s Mathematical 
Reasoning Test for students planning to pursue graduate study in engineering, the 
physical, mathematical, or computer sciences, economics, or some areas of the life 
sciences [26]. This test presumes a year of calculus, and the questions in its new 
version probe understanding of concepts and their reflection in graphical or 
applied settings, areas of particular prominence in calculus reform. 

The foregoing sections raise a natural question: what national impact has 
calculus reform actually had? The rest of this section addresses that question. 

More than 500 mathematics departments at postsecondary institutions nation- 
wide are currently implementing some level of calculus reform. These “reformed” 
courses enroll approximately 300,000 students each year, about 32% of the total 
national calculus enrollment [27]. Such growth of a movement only a decade old 
suggests that the influence of calculus reform will continue to spread. In fact, many 
institutions are now initiating programs to improve learning in a wide range of 
science, mathematics, engineering, and technology courses. These include projects 
that eliminate the traditional boundaries between these disciplines by means of an 
integrated teaching approach. Such programs, which respond to the new require- 
ments students face in an increasingly technical and multidimensional workplace, 
represent a fundamental change in the philosophy that has long guided the 
structure of undergraduate education. 

At present, there are few studies that document the impact of these efforts on 
student learning, faculty and student attitudes, and the general environment at 
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undergraduate institutions. There has been considerable study of student learning 
in calculus, e.g. [6], [8], [9], [10], and [24]. To make meaningful judgments about the 
value of reform efforts, it is necessary to understand not only how students learn, 
but also how different environments influence their ability to learn. More studies 
are essential if the academic community is to understand the implications of this 
change in philosophy on learning within and across disciplines, throughout a 
student’s experience at the undergraduate level, and beyond. 

One such study is part of a larger effort by the National Science Foundation to 
evaluate the impact of reform in science, mathematics, engineering, and technol- 
ogy education at the undergraduate level [9]. It investigated the effect of calculus 
reform on: student achievement, attitudes, and retention; the implementation of 
mechanisms that have shown promise in improving the learning environment (e.g., 
student-centered learning, multiple methods of delivery, and alternative measures 
of assessment); and the general educational environment. Although the study 
yielded mixed results for student achievement, several trends did emerge. For 
example, approximately 50% of the institutions conducting studies on the impact 
of technology reported increases in conceptual understanding, greater facility with 
visualization and graphical representations, and the ability to solve a wider variety 
of more difficult problems, without any loss of computational skills. Another 40% 
reported that students in classes with technology had done at least as well as those 
in traditional courses. The impact of long-term projects and group work on student 
achievement is less clear. However, a pattern was discernible regarding the type of 
student likely to excel in this environment. This includes students who are “above 
average” in mathematics, students who do not perform well on traditional tests, 
and engineering majors. 

Projects and group work also seem to affect grade distribution, although not in 
consistent ways. For example, one institution reported that projects made the final 
grade distribution more bi-polar, with very few “C” students, while another 
reported that projects were “the great equalizer,” with more C’s resulting from the 
subjective nature of grading the projects and concomitant inability to justify grades 
at the extremes. The effect of projects on course grades also seemed to be 
influenced by whether the projects were individual or group: the latter seemed to 
“equalize” the grade distribution more than individual assignments. 

The existence of many common elements throughout the majority of the 
projects implies that the relative success or failure of reform efforts is not 
necessarily dependent solely upon content, but also on how, by whom, and in what 
setting the approach is implemented. Consistent reactions of students from a wide 
variety of institutions point to several key components for a successful reform 
environment. For example, instructors must communicate to students (and other 
faculty) the purpose of the changes being made in the calculus course. This is often 
not as easy as it may seem, because the reasons for the change must strike the 
students as relevant and important to future success. Unfortunately, many students 
believe mathematics is a static list of rules and algorithms to be memorized, a 
barrier to be overcome before they can do “real” work in other disciplines. One of 
the most important roles of reform efforts, then, is to challenge those beliefs and 
help students appreciate the many uses of calculus, both within mathematics and 
in a wide range of other disciplines. Thus far, the most effective means to this 
end still await identification. 

Two major goals of calculus reform have been to revitalize the sequence and to 
generate discussion within the mathematics community about the nature and 
content of the calculus course. The lively debate the introduction cites attests to 
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the fact that these goals have indeed been realized! In itself, that constitutes a 
major accomplishment that should not be overlooked. Are the reform efforts in 
fact helping students better understand calculus and appreciate the importance of 
mathematics in their lives? This basic question may never be definitively answered, 
but it cries out for study if calculus courses—and mathematics in general—are to 
remain a vital part of the undergraduate curriculum. 
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Some Fundamental Control Theory II: 
Feedback Linearization of Single Input 
Nonlinear Systems 


Wiliam J. Terrell 


1. INTRODUCTION. In Part I of this article [12] we characterized the single- 
input single-output linear 


x’ = Ax + bu(t) (la) 
Vex (1b) 


where A is n Xn, x and b aren X 1, and c is n X 1, that can be transformed by 
a nonsingular linear transformation, z = 7x, to a linear companion form 


0 1 0 ne 0 0 
0 0 1 me 0 
z’ = Pz + du(t) = a - [z+]: fu, (2) 
: . ; aint 1 0 
= Ki Kips a —k; 1 


which has the desirable property of complete controllability. By the use of state 
feedback, u = Kx with K a 1 Xn matrix, such systems may be expressed in the 
particularly simple form, z\”’ =v, where v is a new reference input that is 
available for control purposes. 

For convenience we summarize the main result of Part I in Theorem 1; see [12] 
for definitions of the relevant concepts. 


Theorem 1. System (1a) can be transformed by a nonsingular linear transformation, 
z = Tx, to the companion system (2), if and only if rank [|b Ab... A"~'b] = n (a) is 
completely controllable). When this is the case, T is unique and 


where 7 is the unique solution of 
tT[bAb... A" 'b] = [0...01] =d’'. (4) 


There exists a nonsingular transformation z = Tx taking (la) to the companion system 
(2) with z, = y =c'x, if and only if 


C 
rank =n, 


a 
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(system (la), (1b) is completely observable) and 
c’[b Ab... A"-'b] = [0...01] =d’. 
When this is the case, T is unique and is the matrix (3) with t = c". 


Our main goal in Part II is to generalize Theorem 1 for the simplest form of 
single input nonlinear systems. The exposition uses ideas from the theory of 
differential equations, linear algebra, and analysis, and basic concepts in differen- 
tial geometry. The equivalence problem we consider is one of the fundamental 
results of geometric nonlinear control theory. The original formulation and solu- 
tion of this equivalence problem in the single-input case is due to R. W. Brockett 
[1]. 

We begin with a calculation that recapitulates the essential calculations that 
established Theorem 1. It is also useful in the generalization of Theorem 1. 
Consider the following product of n X n matrices, 


0 ae ore 0 cA" |b 
C Q * *K 
clA : ; 
“ \Eb Ab + A"! b] = » |, (5) 
a a 0 
cl A"—!h 


which results when c, A, b satisfy c’A*b = 0 for0 <k <n —2.If c'A"”"'b # Oas 
well, then both matrices on the left must be nonsingular. In Part I we showed that, 
conversely, if one of the matrices on the left is nonsingular (for some b, respec- 
tively c), then the other matrix is a/so nonsingular (for some c, respectively b). 
Nonsingularity on the right in (5) implies an observability condition and a control- 
lability condition [12]. The geometric interpretation of the zeros on the right hand 
side is that the null space of the linear functional y=c'x is the (n — 1) 
dimensional space, span {b, Ab,..., A"~*b}. The duality aspects of Part I arise 
from the duality between vectors and linear functionals in a finite dimensional 
vector space. Similar duality considerations in Part II involve the pairing of vector 
fields and co-vector fields (or differential 1-forms). 

If we replace the linear vector field Ax in (1) by f(x), and the vector b by a 
vector field g(x), we are interested in determining exactly when the resulting 
single input nonlinear system x’ = f(x) + g(x)u may be transformed to the special 
form y'” = v by local coordinate change and state feedback. If such a transformation 
is possible, one can exploit the special control-theoretic properties discussed in [12] 
for that special form. In such a case, a nonlinear system can be controlled using 
linear control methods. 


Example 1. Suppose x € R° and u & R. Is the system 


exp( —X2) | 
x’ =f(x) +g(x)u= xy +\Oly 
sx? 0 


equivalent to y“) = v? What is this equivalence, and how do we determine if such 
an equivalence is possible? 


2. LINEARIZATION AND FUNCTIONS OF RELATIVE DEGREE rn. Consider 
the single-input single-output nonlinear version of (1a) described by 


x = f(x) + (xu, (6a) 
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where f and g are smooth vector fields defined in some open region & of R”. The 
term smooth means that the components of f and g are continuously differen- 
tiable as often as required in our discussion. Equation (6a) may be augmented with 
an output, 

y=h(x), (6b) 
where /t is a smooth real-valued function defined on &. We are interested in 
necessary and sufficient conditions under which system (6a) is equivalent to a 
linear companion system. Such an equivalence, if possible, generally requires 
feedback in addition to coordinate change: our examples make it clear that (4), the 
condition that allows for a coordinate transformation directly to companion form 
in the linear case, cannot generally be expected here. 


2.1 Input-Output Linearization. As in the linear case, we can try to use a known 
output /: to help define a local coordinate change, z = T(x), and state feedback, 
u = a(x) + B(x)v (with B(x) # 0), in a neighborhood U of x, in (6), to produce a 
linear input-output equation, 


yD =v, (7) 
{n (7), v is called the new reference input, and we would like to have j = 1 in order 
to capture the full n-dimensional dynamical system. This is the Input-Output 
Linearization Problem ((OLP). The idea is that feedback by u = a(x) simplifies 
the system equations by cancelling nonlinearities when possible, and then subse- 
quent controls v are available to control the dynamics in a desired manner. 

The next definition recalls the behavior of the special outputs that yield 
observability and a companion form in the linear case of [12]. The definition is 
motivated by the form of the terms that appear when a function is differentiated 
repeatedly along system (6a). 


Definition 1. The Lie derivative of a real-valued function / along a vector field g 
is the real-valued function Lh defined by L,h(x) = dh(x)- g(x), where dh(x) is 
the row gradient of / at x. For iterated derivatives of this type we write Li h=h, 
and Lih =L, (Een): The function h has relative degree j => 1 with respect to 
(6a) at the point ak 


(i) L,Li-h(x) = 0 for all x in a neighborhood of x9, for allO <k <j — 1, 
and 
(ii) L,Li'A(xy) # 0. 


The relative degree is the number of differentiations of / along the system that 
are needed to make u appear explicitly. Thus, for relative degree j = 1, note that 
only condition (i) is relevant. There may be points x, where a relative degree for a 
function is not defined. By continuity, conditions (i) and (i) allow us to.speak of a 
function / having relative degree j im an open set U containing Xp. 


Example 2. Consider the system 
x); = sinx, (8) 
xy = -xj tu. (9) 


The function y = h(x) = x, has relative degree | at all points since L,h(x) = 1. 
On the other hand, y = h(x) = x, satisfies L,h = 0, and L,L;h(x) = oe (Sin. x4) 
= cosx,, So this function has lative degree 2 at the Series X) = = () of the 
unforced system. 


814 FUNDAMENTAL CONTROL THEORY II [Monthly 106 


Functions having relative degree n are especially useful. Theorem 1 illustrates 
why linear functionals of relative degree m were central in [12]. 

Suppose an output y = A(x) has relative degree n at x). Then y and the first 
n — 1 time derivatives of y along (6a) yield the equations 


y h(x) 0 

ye: L h(x) 

: = | a 0 U. (10) 
yond L''A(x) L,Li"h(x) 


In Proposition 2 we show that if A has relative degree n at x), then the vector 
function [A L;h «+ Li 'h]’ has nonsingular Jacobian with respect to x at x9; 
thus, the functions Li n(x) for 0 < k <n — 1 can be used as component functions 
in a nonlinear edordinate transformation defined locally around x,). The presence 
of the nonzero function L, Li” ‘h(x) as a coefficient of u prevents us from 
obtaining the linear, controllable companion form for z = [y, y’,..., y"~"] by 
coordinate transformation alone. However, the coordinate transformation together 
with state feedback does produce the simple form, y‘”’ = v. We need only define 


= —————(-L""'h(x) + v), 11 
Eira ae (11) 
where v is the new reference input. This situation allows for control action on the 
nonlinear system by operating with the linear controllable form (7) (with j = 7). 
Note that an assumption of relative degree j <n leads to a linear input-output 
relation (7). Relative degree n of h at x, is a type of local observability condition: 
with u = 0, the Inverse Function Theorem [9, p. 193] applied to (10) implies that x 
is determined by [y, y’,..., vy" ] near xp. 


2.2 Input-State Linearization. If a relative degree n function is not readily avail- 
able as an output function, we still ask for a local coordinate change, z = T(x), 
and feedback u = a(x) + B(x)u with B(x) # 0 near xo, that produces a control- 
lable linear system for z. This is often called the Input-State Linearization Problem 
(ISLP). As shown in [12], given a controllable linear system, an additional coordi- 
nate change and state feedback may be used to produce the linear system 


0 1 O0O + 0 0 
0. 0 2 oe & 0 
z=Nzt+dv=]- + + oe + lzt ts fu, (12) 
- &) “e se oa 0 
0 0.0 + O 1 


where JN is the standard nilpotent block with ones on the superdiagonal and zeros 
elsewhere, and d =[0...0 1]’ as usual. Thus we take the ISLP to mean the 
problem of achieving the form (12). 

Our first goal is to show that the ISLP is solvable if and only if there exists a 
function of relative degree m at x, for (6a). We have already discussed the 
sufficiency of the relative degree n condition, though the proof of Proposition 2 is 
needed to complete that discussion. After the next example, we show that solvabil- 
ity of the ISLP implies the existence of a function A(x) of relative degree n at x, 
—namely, the first coordinate function 7, of the transformation T. Our ultimate 
goal is to derive Se as geometric sondiions for the existence of a relative 
degree n function. 


November 1999] FUNDAMENTAL CONTROL THEORY II 815 


One can adopt the point of view that there is no essential mathematical 
difference between the linearization problems IOLP and ISLP when a relative 
degree n function exists—the difference lies in whether there is an explicitly known 
output function of relative degree n, or if such a function must be constructed. 
From the practical point of view, of course, this is an essential difference. We 
return to Example 2 to illustrate an important consideration. 


Example 3. Consider again the system, 
/ . i 
x, = sinx, 


: 
Ro Se 


It is easy to linearize the input-output behavior; take y = x,, and set u =x; + uv, 
which produces the linear input-output relation y’ = v. This relation would make 
it easy to track a specified output y(t) =x,(t) by control v, but the linearizing 
control uw would make x, unobservable if it is only the resulting input-output 
relation y’ =v that is considered. Thus, some of the original two-dimensional 
dynamics is hidden by the input-output relation. The presence of such unobserv- 
able dynamics introduces the question of internal stability for that dynamics; the x, 
- variable may not remain well-behaved (e.g., bounded) when using a control 
strategy based on the relation y’ = v. For example, suppose we wanted to hold the 
output y =x, via feedback at a constant value, x, =c. Then the x, solution 
would be x,(¢) = x,(0) + tsinc, and therefore x, ~ ~ as t > %. 


We now show that a solution of the ISLP entails a relative degree n function. 
Suppose the ISLP is solved, so that the transformation z = T(x), combined with 
feedback u = a(x) + B(x)v, produces the linear system (12). From the definitions 
of the variables, this occurs if and only if 


OT 
ec + g(x)u) = NT(x) + du, (13) 
x 
where equality holds for all u = a(x) + B(x)v, with v arbitrary. By considering 


u = 0 and u = 1, we see that (13) is equivalent to the two partial differential 
equations: 


dT 
alls) = NT (x) = dal x) Bo'(x), oe 
dT 
8 (x) = dB"'(x). ia: 


If T(x) = [T,(x) T(x)... T(x], then 
NT(x) — da(x) B7'(x) = [T,(x) T;(x)...T,(x) — a(x) B(x)", 
and 
dp-'(x) = [00...087'(x)]’. 
This allows us to display (14a) in detail: 
dT (x) fe) Sel ey 1a). KS Wyoag dy (15a) 
and 


dT, (x) -f(x) = —a(x)B™'(2). (15b) 
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Therefore all components of 7 are determined by the first component 7,. And the 
last equation specifies the ratio, — a(x)B8~'(x), in terms of T,. Similarly we display 
(14b) using (15a) and Definition 1: 


dT, (x) -g(*) =L, LAT (x) =0, k=0,...,n- 2, (16a) 
while 
dT,(x) -g(x) = LL 'T\(x) = Box) #0. (16b) 


Thus, by (15a) and (16), the solution of the ISLP requires a function T(x) that has 
relative degree n. Given T,, the rest of T is then defined by (15a), and the required 
feedback u = a(x) + B(x) v is determined by 


PTs) 


Fearn ere (17) 
LL; 'T (x) 


BSS ia Oe) = 
L,Li'T,(x) 

Notice that an equilibrium point of interest, say x) such that f(x,) = 0, can always 

be transformed to z = T(x,)) = 0: by (15a), we need only require that 7;(x,) = 0. 


We summarize the preceding discussion as follows. 


Theorem 2. System (6a) can be transformed by coordinate transformation z = T(x) 
and state feedback u = a(x) + B(x)u in a neighborhood of x, to the linear control- 
lable form z' = Nz + duif and only if there exists a function » having relative degree n 
with respect to (6a) at the point x. When this is the case, T is determined by (15a) by 
defining the first component function by T, = A, and the feedback is determined 
by (17). 


Proof: Everything has been proved except the legitimacy of the coordinate change 
z=T(x) = | A(x), L,A(x),. gl Ce) | 


when A has relative degree n at x). This argument appears as part of a more 
general statement in Proposition 2. Assuming this result, Theorem 2 follows. a 


Let us illustrate these ideas, and the general nonuniqueness of T in the 
nonlinear case: 


Example 4. Consider the system in Example 3 near the equilibrium x, = 0 of the 
unforced system (with u = 0). The equations (16) for 7, are 


L,T(x) =0, L,T,#0; T,=dT\(x)-f(*). 
Using g = [01]’ we see that 7, is independent of x,, so T,(x) = (dT, /dx,)sinx,. 
The nontriviality condition becomes 
dT, dT, 


Ljdi,=- = —cosx, # 0, 
JX OX, . 


which holds as long as cosx, # 0 and dT,/dx, # 0. One solution is T\(x) = x,, 
which we used in Example 3. The choice is not unique: T,(x) = x, — x; also works 
near x, = 0 in fulfilling the conditions of Theorem 2. 


Example 5. Using Theorem 2 we can show that Example 1 is not input-state 


linearizable. The vector field g in Example 1 is g = [1 0 O]’. Let A be a smooth 
function, and suppose L, A(x) = (dA/dx,)(x) = 0 for x in an open set U; then A 
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is independent of x, in U, so A = A(x,, x3) there. If we also impose the condition 
that L,L,A(x) = 0 in U, then we have 


bane | d OX OX 1 ; 
SCaAS x)= Me ae ae 
AE) ONO Oke. 
OX OX 
Se a 
JX OX 


and it is not possible for this to hold in U when A = A(x,, x3). Thus, there is no 
function having relative degree 3 with respect to this system in any open set in R°. 


3. NECESSARY CONDITIONS FOR ISLP SOLVABILITY. We’ve seen that an 
“artificial output” T, of relative degree n is determined by a nontrivial solution of 
the partial differential equation system (16a), the nontriviality condition being 
(16b). When is this first order partial differential equation system solvable? We 
want computable conditions for solvability directly in terms of f and g. Let us first 
consider necessary conditions, and for that, a nonlinear version of the calculation 
in (5) is helpful, with an appropriate generalization of the columns b, Ab,... in 
_ (5). This analysis of necessary conditions completes the proof of Theorem 2 and 
leads to the computable criteria we seek. We should remark that there are indeed 
local observability and controllability conditions involved in what follows, as you 
might expect. After all, we are transforming to a linear system with the properties 
of controllability and observability. We discuss a local reachability property after 
the main Theorem 3—this property is related to, but weaker than, complete 
controllability. For now, we continue to focus on the goal of generalizing the 
transformation to companion form statements of Theorem 1, but in the process 
we indicate the intuition involved in generalizations of the linear controllability 
criterion. 

Let us consider (5): we replace the first matrix on the left by 
[dAdL;v...dL''d]", where A has relative degree n, and we put g(x) in place 
of b fe the sceead matrix on the left; we then need vector fields to replace 
Ab,...,A"~'b. That is, we need to identify the null space of the differential dA. 
The appropriate vector fields can be motivated either analytically or algebraically, 
and we consider both aspects in order to build some intuition. 

The next definition can be motivated by the calculation of Proposition 1, but it 
is convenient to state it here and follow it with an important example. 


Definition 2. The Lie bracket [g,, g,] of two vector fields g,, g, is the vector field 
defined by 


dg dg | 
[815 82](*) = Sy 6) 804) = SO) 824). (18) 


Here is a notation that helps with iterated brackets: define ad) g, = g5, ad, 8) = 
[2,, 25], and ade 8) = = eis adi 'g,] for k => 1. The brackets described in (18) are 
important in the linear system case: 


Example 6. If f(x) = Ax and g(x) = 5b, then [f, g(x) = —Ab. Also, ad; g(x) = = 
Lf, Lf, glx) = Ad, and in general, adjg(x) = = (—1)*A*b. 


Example 6 suggests that the brackets ad/g(x) are important in the nonlinear 
case. To confirm this, we consider an analytic relation involving f and g. Write 
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(x) for the time ¢ solution map of the differential equation x’ = 7(x); that is, 
bj(x) =x and db(x)/dt = nlo7(x)). Then (67)~'! = 67, where defined. Also 
write (,”), for the derivative map with Jacobian matrix d¢/’/dx. The variational 
differential equation satisfied by ("),, is the equation 


d - a i s 
5 hes = GCG) 


to prove this relation, one differentiates the identity d/dt(/(x)) = n(¢/"(x)) with 
respect to x and then interchanges the order of differentiations with respect to 
x and ¢. It is also useful to have the variational equation for (6",). = (67)7': 
the formula 


d a 7 dA 7 
qf (= AU) GAT!) 


for the derivative of the inverse of a nonsingular matrix function of ft, applied to 
A = (¢/"),, gives 


Z 7) es Ll ED) on 
as) 4 _ (OL). Dia.? 


Now consider following the flow of f (the vector field in (1a)) for a short time f, 
to the point ¢/(x,); then determine the direction vector (f + g)(/(x,)), the 
direction you would move if you turned the control u “on” with u = 1; and, finally, 
transfer this tangent vector back to the point x, by applying (#/,),. Thus, 
consider the “curve of tangents” based at xo, 


V(t) = (¢1,).0(¢/(%0)), (19) 
where 7 = f + g, and more specifically, the derivative V'(O) at ¢ = 0. By consider- 
ing shorter and shorter times f, it is plausible that the vector V’'(0) (and also the 
higher order derivatives of V at t = 0) should indicate something about the 
directions we might move, starting at x,, by some suitable “off-on” switching 
strategy for the input u. This can be made precise, in a way that provides an 
alternative motivation for the Lie bracket operation; see [8, Proposition 3.6, 
pp. 77-78] or [7, pp. 323-324]. We have motivated (19) here because it is useful in 
the proof of the main Theorem 3. Notice that the construction in (19) is valid for 
any vector field 7, although 7 = f + g is the immediate interest. 


Proposition 1. The tangent vector V'(O) of the curve (19) (where n = f + g) is 
V'(0) = [f, a](40) = Lf 8 Mo)- 
If f and g are analytic, then V“(0) = adi'g(x,) for all k = 1. 


Proof: Use the formula for the derivative of a product, together with the varia- 
tional equations, to compute 

a gy 9-2 
—(#1,).0( 4/40) + (64). 52 


V(t | 
( ) “Ax dt 


(4/(Xo)) 


ce 2h | Dit WO oak oe 
—(62,). 5 mb! (%0)) + (641). FI O/ 0) 
= (6£,).. fF 1] (6/(%0))- (20) 


Set ¢=0 to get V'(O) =[f, n(xq), taking into account the initial conditions 
bj(x) =x and (J), =I. Since [f, f+ ¢] =[f, g], the first statement is proved. 
Given (20), the second statement follows by induction. Bs 
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Because of Proposition 1, the Lie bracket [ f, 7] is also called the Lie derivative 
of n along f. 

The algebraic property that shows the connection between the Lie bracket 
operation on vector fields and the Lie derivative operation on functions is the 
Jacobi identity; 1t says that if v, w are vector fields and A is a smooth function, then 
(Lin wi WV) = (LL, — L,, L A(x). This identity is proved as follows. For any 
smooth A, and any x, : 


L,LyACx) — L, L,A(x) = L,(daA(x) -w(x)) — £,,(dACx) + u(x)) 


= ae -w(x) + dd(x) - (x) U(X) 


OU 
= [an(.) -u(x) + dA(x) - = (2)| ‘w(x), (21) 
Ox 
where d*\(x) is the matrix of second partial derivatives of A at x. Since dA is 
symmetric, we get 


=dr:[v,w] =L,, ,A- 


ad Ww 


OW OU 
bochih Ah ee SNe ( = —w] 

x OX 
The Jacobi identity itself helps to identify ker dA(x) for x € U, for a function A 
having relative degree n. Given g as the replacement for b in (5), and assuming 
that dA(x)- g(x) = 0, one can show that ad; g(x) & ker d\(x) for k = 1,...n —2 
by induction, using the Jacobi identity. We now give the details of the nonlinear 

version of the calculation in (5). 


Proposition 2. If A has relative degree n with respect to (6a) in the open set U, then 
for allx € U, 


(1) the covectors d(x), dL, ACx),..., AL4~ 'ACx) are linearly independent; 


(2) the vectors g(x), ad;g(x),..., ad'~' g(x) are linearly independent. 


Proof: Consider the matrix product that generalizes (5): 


dA( x) 
i i | g(x) ad g(x) ad?! g(x) | 
dL" 'A(x) 
L, A(x) Laag MX) is Lgar-'g MX) 
_ L,L;A(x) Lgan-2gh MX) * ie 
L, Li 'a(x) * ‘rs oe se 


We now use the relative degree n assumption and the Jacobi identity to show that 
the matrix on the right in (22) is lower right triangular with nonzero entries on the 
skew-diagonal; Proposition 2 then follows. 

By relative degree n, the first column has the required form. Proceed by 
induction on the columns, using the Jacobi identity. Now the k,/ entry in the 
matrix is (d Lith) ‘(ad'-g), for 0 < k,l <n —1. The diagonal entries in question 
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are those for which k +/=n — 1. Assume the desired property for column /: 
thus, assume that Li, Lik = 0 fork +) <n —2, and Lay, Li! th # 0. For 
column / + 1 we need a Lyh = Ofor k<n—3-J, and Ligivig Ly? th #0. 
Using the Jacobi identity, ‘column /+11s,for0O<k<n-t: 


(k,l) entry = (dLhh)- (adjt!g) = LyLyatg Loh — Lage Ly 'h. (23) 


Apply the hypothesis to (23) for k =0,...,n —3-—/ to get zero. For k = 
n—2-—T, only the last term in (23) contributes to the skew-diagonal entry, 
which is 


gee Oy oe os I-'h £0), 


a dyg 


Notice that this is the negative of the skew-diagonal entry in column /. Since the 
first column has last entry L, Li” 'h(x), the diagonal entry in column / must 
therefore be (—1)'L, Li 'A(x) # 0 for / = 0,...,n — 1. Thus, our matrix is lower 
right triangular with - TORetO skew-diagonal Snttics for x € U. al 


Notice that Proposition 2 completes the proof of Theorem 2, because it shows 
that if A has relative degree n then the vector function 


| AC), L; A(X), 005; Le 'a(x)]" 


has a nonsingular Jacobian for x € U. 
As in (5), a geometric interpretation of Proposition 2 is that the null space of 
the differential dA(x) is the (n — 1)-dimensional space 


Q(x) = span { g(x), ad;g(x),...,ad}~*g(x)}, cee. (24) 


We view each subspace A(x) as a subspace of the tangent space of R” at x, 
TR" = R". The collection of the subspaces (24) for x € U is called a distribution 
on U. In the linear case, a constant distribution such as span {b, Ab,..., A" ~*b} is 
automatically the null space of a nonzero linear functional. Proposition 2 shows 
that the nonsingularity of both factors on the left in (22) is necessary for the 
solution of the ISLP. Under the assumption of Proposition 2, there is an additional 
necessary condition on the distribution (24), which is not revealed in the calcula- 
tions of Proposition 2. The additional condition on D(x) is the geometric condi- 
tion of involutivity. Involutivity is an integrability condition that guarantees that the 
distribution (24) is the space annihilated by the differential of a function having 
relative degree n. The next section examines this concept, and develops the 
geometric conditions for the solvability of the ISLP. 


4. GEOMETRIC CRITERIA FOR ISLP SOLVABILITY. It’s convenient to place 
some formal definitions here. 


Definition 3. A distribution DY on U is a smooth assignment (via functions like 
g, ad,-g, etc. in (24)) of a subspace of the tangent space 7,U ~ R", for x € U. 
A distribution D(x) is involutive in U if, for vector fields v, and v,, and all x € U, 


v(x), v(x) © D(x) = [4,, 0, ](x) = F(x). 
A distribution J is nonsingular in U if dim G(x) is constant in U. A nonsingular 
distribution DY with dim A(x) = k is integrable in U if there are n — k functions 


A; such that span {dA(x):1 <j <n — k} =Q9~ (x), or equivalently, 
Woy ker dA (x) = D(x). 
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To illustrate the involutivity concept, we return to Example 1. 


Example 7. The distribution (24) for Example 1 is D(x) = span{g(x), ad; g(x)}. 
Appropriate bracket calculations for Example 1 give 


¢ = Ot), 6 
adpg(x) = —| 1 0 .OO; =] 1], 
xy 0 0 || 0 xy 


and then 


O O OF} 1 0 
|g, ad;g|(x) = f 0 18 = } 
1 O O]F,O 1 


If we form the matrix 


1 
| s(x) ad g(x) [s,adya](x)} =] 0 1 ab 


we see that its rank is everywhere equal to 3, so that [g, ad;g\(x) does not lie in 
A(x) for any x. It follows that the distribution FY = span{g, ad;g} is not involutive 
in any open set. Compare this situation with Example 5. 


Frobenius’ Theorem states that for a nonsingular distribution D (of any dimension), 
integrability is equivalent to involutivity [4, p. 23]. The following discussion of the 
ideas of Frobenius’ Theorem in the case of interest for the Input-State Lineariza- 
tion Problem uses virtually all the tools discussed so far. 

It is easy to show that involutivity is necessary for D(x) in (24) to be the tangent 
space at x of the level set within U of a smooth function A having relative 
degree n. If A has relative degree n at x,, Proposition 2 shows that 
[g(x) ad;g(x)...ad~'g(x)] has rank n for all x € U, so the distribution @ is 
nonsingular with constant dimension nm — 1 in U. Moreover, we have 
dN x) g(x) ad;pg(x)... adi *g(x)] = 0. Thus, the pointwise orthogonal comple- 
ment of J in U is D~ = span {da}. This says that the distribution D is integrable 
on U. We now show involutivity directly. If 0 < i,j <n — 2, then for x € U, 


dn( x) ; | adig, ad}.g | (x) — Lgaighadig ( x) - Laaighadig X( x) = 


since A has relative degree n. Therefore [ad)g, ad}g\(x) © A(x) for all x in U. 
This is sufficient to show that @ 1s involutive, since any two vector fields in G are 
linear combinations of the ones we just dealt with, and one can show that for fields 
€, 7 in Y and functions a, b, we have 


[aé, bn](x) = a(x) b(x)[&, n](x) + (Leb(x) a(x) n(x) 
—(L,a(x))b(x) é(x). (25) 


To establish (25), use the Jacobi identity plus the characterization that vector fields 
vu, w are equal if and only if L,,A = L,,A for all smooth functions A. 

Theorem 3 provides the goal of computable geometric criteria on f and g for 
the solution of the Input-State Linearization Problem. Involutivity is the necessary 
and sufficient integrability condition for the system of partial differential equations 
(16a). 
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Theorem 3. The system x' = f(x) + g(x)u is input-state linearizable in a neighbor- 
hood U of x, if and only if 
(I) [g(xq) ad;g(xy)... ad?! g(x,)] has rank n, and 


i 


(II) the distribution D = span{g, ad pZ,..., ad; * g} is involutive in U. 


Proof: We show that (I) and UI) are equivalent to the existence of a function A 
having relative degree n at x). Necessity of (I) is covered by Proposition 2, and 
necessity of (II) was discussed before the theorem statement. 


Sufficiency. Tf (I) and (II) hold, then the distribution Y is nonsingular on U with 
dimension n — 1. Clearly, there is a smooth covector field in U defined by a 
smooth row vector w(x) such that w(x) = span H* (x) and also w(x): ad/'~' g(x) 
# 0 for x € U. Thus, for all x € U, we have 


span {g(x), saxga  “B0)s w(x)} = R", 


The function A can be constructed from the flows of these vector fields. To 
simplify notation for this argument, define v, = g, uv. = ad pg, sgh = adi~* g, 
v= Ww, 

Let U. be a ball of radius € about the zero vector in R". There is an € > 0 such 
that the map w:U. > WU.) C U defined by 


CZ 2 UA 2552 nig) PaO, 0-8 OD XG) 


is a diffeomorphism onto its image, that is, w is smooth, one-to-one, and has a 
smooth inverse map defined on w(U.). This is because repeated application of the 
chain rule shows that at z = 0 we have 


n—| 


ow 
—(0) =v(x)), i=1,...,”, (26) 
OZ; 


and by hypothesis, the vectors u,(x,) are independent. Thus, the Inverse Function 
Theorem ensures that there is an e€ > 0 so that & is a local diffeomorphism onto 
its image. The z coordinates are time coordinates that “straighten out” the flows 
for the vector fields v,. Write the inverse of & in the form 


A(x) 
W(x]: 
X,,( x) 
Now consider (26) together with the identity 


oy} Ow 
| a | =| — ee (27) 
OX z=p'(x) AZ y= (z) 


The strategy is to show that dA,(x) spans Y~ (x), by showing that the first » — 1 
columns of (dp/dz),.. (2) form a basis of A(x) at any x € U, for then (27) implies 
that span {dA,(x)} =F (x) for x € U. 

Using the chain rule, we find that the i-th column of dw/dz 1s 
oy J 
eae ye (pf!) . O ees o( per! )e 5 (e:} O° we 6 bo" (X9)) 


OZ; ie 


— (pi), ESusate (pi ). v, 
= (5!) _ ee (pi ). U, 


Geog (H5)) 
pect ee b'(v(z))), 


a 
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where we use the fact that (6%)"' = #%, for the local flows of the u,. If we show 


Z 
az 


that, whenever &, 7 are vector fields that are pointwise in D we also have 


(77). E(67,(x)) € F(x), 


then it follows that the column vector (d//dz;),.,-1(.) is also in D(x). Since any 
vector field € in M can be written as ic,v, for some functions c; in DM, we need 
only consider the case when é = vu, for some i-= 1,...,n — 1. 

Thus, for a vector field 7 <€Q, let x be a fixed point in J, and set 


V(t) = (67,).u(6"(x)); i=1,...,n- 1. (28) 


The vector functions V, are defined for some interval of ¢ about 0. It follows from 
Proposition | that 


d 
Vit) = (6%) [0 o(67(2)), 


Since @ is involutive and 7, uv, ©Y, there exist functions a@;, defined around x 
such that 


so that 
n—-I 


(O?(x)) = Li ab? (x))Vj(4) . 


j=l 


n—-l 


d 
4 Milt) = (62). "s aie 


Thus, V=[V,...V,_,] is a matrix solution of a linear system of differential 


equations having the form V’ = VA’, where A = Loe| and. Ti, j Sd, 
Therefore we can write 


Mit). Ba] = [Yi0) YC) X(), (29) 
where X(t) is an (n — 1) X (n — 1) fundamental matrix of solutions. Multiply (29) 
from the left by (¢,”),, and use (28) to get 


| v,("(x)) poe v, (67 (x))| = [() v(x) canes ( o,") * v,-{(x)| X(t). 
Now, for small ¢ we may replace x by @”,(x) on the same orbit to get 


[ v,(x) sae v,-4(x)| = [( 6"). v,(o",(x)) ne -("), v, (67, (x))] X(t). 


Since X(t) is nonsingular, we get for i = 1,...,n — 1, 
(,").U(62,(4)) € span {v\(x),.--,U,-1(4)} 
= span (2( 2) arog eC) =Q(x). 


The proof that the columns df/dz; are in DM for 1=1,...,n—1 is now 
complete. 

It remains to show that A, satisfies the nontriviality condition required for 
relative degree n. But from the identity (27) and the conditions (26) we have 


Ow 
dX,,( xo) eae), a dX,,( Xo) ; U,( Xo) = dX,,( x9) ; w( Xo) = ki 


Thus, dA, (x) is parallel to w(x9) €D* (x), so dA, (xq) + adz~'g(xy) # 0. Now 
apply the Jacobi identity to 


Laan-'g X,,(%o) = Lip, adt-2g}An( Xo) ? 
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to conclude that —Lygi-2¢LpA,(%) = (— D"'L, Ly 'Axq) # 0. Therefore A, has 
relative degree n in some open set in U. = 


The proof of sufficiency in Theorem 3 follows the proof in [4, pp. 24-28] that 
involutivity implies integrability. We really needed a particular codimension one 
case of Frobenius’ Theorem, where dim Y~ = 1 and @ is the special distribution 
in (24); however, the argument for the general case is much the same. 

Returning to Example 3, conditions (I) and (I) of Theorem 3 are easily 
checked: (I) holds near x) = 0, and CID holds trivially since M(x) = span {g(x)} = 
span {[0 1]'} is one-dimensional. In fact, in dimension n = 2, conditions (1) and (ID) 
reduce to the single condition that 


rank le(xo LF, 810) | a 


because then g(x) # 0 in some open set about Xp. 


Example 8. The result of Example 7 showed that condition (II) of Theorem 3 does 
not hold for the system of Example 1. Let us check that condition (I) does hold: we 
just need to compute 


0 0 Ofie™ 0 -e*? OF O 
ad?g (x) _ EE ad -g|(x) —~10 O OF} x, |_|} 1 0 0 |} 1 
1 0 O}]f 3x? xX, a 


then, using the calculations from Example 7, we have 


1 O e 
|g(x) adyg(x) adjg(x)]=]0 1 0 |, 
Ge Da e272 


so the rank condition (I) is satisfied at every point. 


We can now discuss condition (I) of Theorem 3 as a type of controllability 
condition known as a local reachability condition. 

Suppose f(x,) = 0, so that x, is an equilibrium for x’ = f(x). The rank 
condition (I) is exactly the condition that the local linearization defined by 


x’ = Ax + bu = or (ry) + g(Xq)u (30) 


is completely controllable, that is, rank[bAb...A"~'b] =n. To see this, notice 
that if we write 


f(x) =Ax+fo(x), g(x) =b+8,(x), 


where (df,/dx)(x)) = 0 and g,(x,) = 0, then an induction proof shows that for 
each k, 


adg(x) = (—1)" A" + p(x), py(%o) = 0. (31) 


Indeed, (31) holds when k = 0 because g(x) =b+¢,(x) with g(x,) = 0. The 
induction step follows by an appropriate bracket calculation using the expansions 
for f and g. Thus, (I) is exactly the complete controllability condition for (30), 
provided f(x,) = 0. 

Let us say that a system x’ = f(x) + g(x)u is locally reachable at x, if there is 
an open set U about x, such that every point x € U can be reached from x, in 
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finite time by means of a control u(t). Under the conditions of Theorem 3, the 
nonlinear system is locally reachable at x, because the system is locally equivalent 
to the completely controllable linear system (12), via the coordinate mapping 
z = T(x) for x € U (and z € T(U)), and the feedback transformation u = a(x) + 
B(x)v = a(T~'z) + B(T'z)v. For if a control v(t) transfers z, = 0 to the point 
zy in time ¢, while the trajectory z(t) remains in T(U) (and this can be done for 
system (12)), then the corresponding u(t) keeps x(t) = T~'(z(t)) within U. There- 
fore, transfers from x9 to any x, in U can be accomplished in finite time. If only 
the rank condition (I) holds, however, then provided f(x.) = 0, local reachability 
of the nonlinear system at x, can still be proved using the controllability rank 
condition for the local linearization (30) together with the help of the Inverse 
Function Theorem. For one result of this type, which implies local reachability at 
Xy, see [8, Proposition 3.3, pp. 74-75]. 

The method of input-state linearization has been successful in addressing 
specific control problems in the areas of aircraft flight control and robotics [10, 
p. 207]. Here is a final example concerning the equations for a single-link robotic 
manipulator. 


_ Example 9. [6, p. 528] [10, p. 242] The dynamical equations for a single-link, 
flexible-joint mechanism with negligible damping are 


Iq + MGL sing, + k(q, — gq.) = 0 

Jqy — k(q; — 42) =u, 
where q, and q, are angular positions, J and J are moments of inertia, k is a 
spring constant, M is a mass, G is the gravitational constant, L is a distance, and u 


is a motor torque input. By writing x =[q, 9/q, q5]', the four-dimensional state 
equations can be written as 


X>5 0 
—asinx, — b(x, — x3) 0 
x’ =f(x) +g(x)u= ; o [ue (32) 
4 
CCX, = Xy) d 


using these positive constants: a =(MGL)/I, b=k/I, c=k/J, and d=1//. 
The unforced system has an equilibrium at x = 0. To determine if this system is 
input-state linearizable near the origin, we check conditions (1), (II) of Theorem 3. 
First, compute 


0 1 0 0 
Of | —acosx,-b O 5b O 
Ox 0 0 oO Lf 

C 0 -ce O 


In this case, the spanning vector fields for M are constant, and appropriate bracket 
calculations lead to the matrix needed in condition (1): 


0 O 0 —bd 
0 O bd 0 
0 -d 0 cd 
d 0 —cd 0 


Since this matrix has rank 4, condition (I) holds everywhere. It is immediate that 
the involutivity condition (II) holds everywhere, since D(x) = spantg, ad;g, ad;g} 


E adyg adzg adjg| = 
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is spanned by constant vector fields. By Theorem 3, system (32) is input-state 
linearizable, and the construction of the function 7, with relative degree 4 can 
now be attempted. In fact, since we know that the null space of dT,(x) must be the 
flat distribution (linear subspace) defined by span {g, ad;g, ad7g}, we take T, to be 
a linear function of x, alone: T,(x) = x,. The complete coordinate transformation 
T(x) is then obtained from (16a), yielding 


T(x) xy 
z=T(x)= AG) = ‘ 

T;( x) —asinx, — b(x,; — x3) 

T(x) — ax, COSX, — b( x, — x4) 


Alternatively, one can construct 7, directly from (16) for this example: the details 
appear in [6, pp. 528-529]. By defining the feedback transformation u = a(x) + 
B(x)v according to (17), the equations for z are given by (12); equivalently, 
z\) = v. A control v can now be designed that makes the link position x, =z, 
track a prespecified trajectory. 


Clearly, systems that are fully input-state linearizable are very special. Neverthe- 
less, as shown by the analysis in [4, pp. 162 — 172], if a system is not fully 
input-state linearizable as in Theorem 3, but an output function with relative 
degree r <n is known, then the system can still be transformed to a partially 
linear normal form, which is quite useful in many control problems. As we noted in 
Example 3, the case when r < n requires the examination of unobservable dynam- 
ics (usually called zero dynamics) [4, pp. 163-164]. However, the circle of ideas 
discussed here can be applied to a wide range of problems beyond the special 
conditions of Theorem 3. 


9. FURTHER READING. The equivalence problem discussed here was first con- 
sidered in [1]. For the origin of the use of Lie brackets in the study of reachability 
problems see [3] and the references therein. See also [5, pp. 1-2] for some 
historical insight on the introduction into control theory of differential-geometric 
ideas centered around the Lie bracket. Reference [4] presents important control 
methods using extensions of the basic ideas discussed in this article, and includes a 
development of the required differential-geometric concepts. References [4] and 
[10] discuss the important issue of stability of unobservable dynamics (called zero 
dynamics) that arose in Example 3, which comes from [6]. Information on control- 
lability, observability, and numerous other issues appears in [8] and [11]. Reference 
[8, p. 59] contains a statement of the classical Frobenius’ Theorem on integrability 
of a system of linear, first-order partial differential equations. See [6] and [7] for 
some engineering emphasis combined with excellent theoretical exposition. Much 
of the differential-geometric nonlinear control theory generalizes the geometric 
approach to linear control theory presented in [13]. Many references to the 
primary mathematical control literature may be found in the references listed. 
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MATHEMATICS 
C. O. Oakley, Haverford College 


Mathematics is one component of any plan for liberal education. 
Mother of all the sciences, it is a builder of the imagination, a 
weaver of patterns of sheer thought, an intuitive dreamer, a poet. 
The study of mathematics cannot be replaced by any other activity 
that will train and develop man’s purely logical faculties to the same 
level of rationality. Through countless dimensions, riding high the 
winds of intellectual adventure and filled with the zest of discovery, 
the mathematician tracks the heavens for harmony and eternal 
verity. There is not wholly unexpected surprise, but surprise 
nevertheless, that mathematics has direct application to the physi- 
cal world about us. For mathematics, in a wilderness of tragedy and 
change, is a creature of the mind, born to the cry of humanity in 
search of an invariant reality, immutable in substance, unalterable 
with time. Mathematics is an infinity of flexibles forcing pure 
thought into a cosmos. It is an arc of austerity cutting realms of 
reason with geodesic grandeur. Mathematics is crystallized clarity, 
precision personified, beauty distilled and rigorously sublimated. 
The life of the spirit is a life of thought; the ideal of thought is 
truth; everlasting truth is the goal of mathematics. 
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The Dirichlet Problem for Ellipsoids 


John A. Baker 


The purpose of this paper is to present two elementary (and perhaps somewhat 
novel) solutions of the Dirichlet problem for ellipsoids in R”. One of these is based 
on an elegant result of Ernst Fischer—of Riesz—Fischer fame. 

By the Dirichlet problem (for the Laplacian) we mean the following: Given a 
bounded region (nonempty, open, connected set) Q in R”,n > 2, and given a 
continuous function f:d00— R (called the boundary data), find a continuous 
function u:Q — R such that u is C* on Q, 

ng 
u(x) =0 for xe and 
k=1 OX 
u(x) =f(x) for x € dQ—the boundary of 0. 


This is surely one of the most influential problems for the development of 

mathematics in the last two centuries; see, for example, [4], [6], and [8]. The case in 
which © is a disk in R* is standard fare for writers of texts on the theory of 
analytic functions of one complex variable. In this paper we are concerned with the 
case in which (D is an ellipsoid in R”, for arbitrary n > 2, and especially when f is 
(the restriction to dQ of) a polynomial function. 
1. BACKGROUND AND NOTATION. Let 2 <n © Z. For x = (x,,...,x,,) and 
y =(y,,..., y,) in R", let x-y = Lf_, x,y, and |x| = (x-x)'”. Forl <k <n we 
write J, instead of 0/dx, and define the Laplacian A = L}_, 07. If © is a region 
in R” and v:Q — R then we say that v is harmonic on ©. provided v is C? (twice 
continuously differentiable) on and vu satisfies the Laplace equation 


Av(x) =0 forallxe Q. 


For completeness we include a well known proof [9, p. 103] of the weak form of 


The Maximum Principle. [f Q is a bounded region in R", v: Q, > R, v is continuous 
on Q,v is C? on Q, and Av(x) = 0 for all x € O, then vu attains its maximum 
on o&). 


Proof: For 0 < € € R let v(x) = v(x) + elx|> for x € 0. Then v, is continuous 
on Q, C’ on © and, for all x € QO, 


fl A 
)) dgu(x) = )) dfv(x) + 2ne > 2ne > 0. 
k=1 k=1 


Hence, for each x © © there is a k(1 <k <n) such that dfu,(x) > 0; single 
variable calculus ensures that v, does not have a relative maximum at x. It follows 
that, for each e€ > 0, uv, attains its maximum on dQ. That is, for each € > 0 there 
exists x, € JQ such that v.(x) < v,(x,) for all x € 0. Hence, for e > Oand x € 0 
v(x) < v(x) < v(x.) + elx.|°, so that v(x) < max{u(y): y € dO} + €R*, where 
R= max{|x|: x © QO}. Since this is so for every « > 0, v(x) < max{v(y): y © dQ} 
for all x € Q. a 
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Corollary 1. Jf Q is a bounded region in R",u:Q, > R, and u is continuous on 2 
and harmonic on Q,, then u attains its maximum and its minimum on dQ. 


For proof it suffices to note that since u is harmonic on (1, so is —u. 


Corollary 2. If Q is a bounded region in R” and f:0Q > R is continuous, then 
there is at most one solution to the Dirichlet problem satisfying u(x) = f(x) for all 
x€ oo). 


Proof: Suppose that u, and u, were such solutions and let v(x) = u(x) — u(x) 


for x € Q. Then v is continuous on ©, harmonic on Q, and vanishes on dQ. 


Hence, by Corollary 1, for every x € ] we have 
0 = min{v(y):y € dO} < u(x) < max{v(y): y € dO} = 0, 


i.e., 0 = v(x) = u(x) — u(x) for all x € 0. cs) 
Let’s fix n > 2 in Z, let r,,..., 7, > 0, let 
B(x) = 1=— Dx Ol Ko (0 cic eR” 
k=1 


and let £ denote the ellipsoid {x € IR” : b(x) > 0} so that & = {x € R” : b(x) = O}. 
We aim to solve the Dirichlet problem for & by first showing that it can be 
handled in the case of polynomial boundary data with the aid of an elementary 
consequence of some work of E. Fischer. 

Let F denote the real algebra of all polynomial functions from R” into R. For 
0<méeEZ, &,, denotes the finite dimensional linear subspace of # consisting of 
those members of # having degree at most m. 

The following elegant result surely deserves to be better known. It has its origins 
in the 1917 paper [7] of Ernst Fischer; see the discussion of (2.8) on page 459 of 
[10]. Let’s call it 


Fischer’s Lemma. For f €# define L(f) = ACfb). Then L is a linear, degree— 
preserving, bijection of F onto itself. 


Proof: Clearly L is a linear operator on ¥. Suppose that L(f) = 0 for some 
f €F. Let u = fb so that Au(x) = 0 for all x € R” and u(x) = 0 for all x € JF. 
By Corollary 2, u(x) = 0 for all x € so f(x) = 0 for all x © & since b(x) > 0 
for all x © &. But f is a polynomial; hence f(x) = 0 for all x © R”. We have 
shown that L is one-to-one. 

Now suppose that O<meZ. If fEFY, then fbEe#,,, and hence 
A( fb) € Y,,. That is, L maps #%,, into itself. Since #,, is finite dimensional and L 
is linear and one-to-one, L maps Y&,, onto itself. | a 


2. THE DIRICHLET PROBLEM FOR ELLIPSOIDS AND POLYNOMIAL 
BOUNDARY DATA. Fischer’s Lemma and the Maximum Principle, yield a simple 
proof of 


Theorem 1. Suppose that f <= F,, for some m = 0. Then there exists a unique u in 
F,, such that 


Au(x) =0 forallx © R" and u(x) =f(x) forallxed&. (#) 
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Proof: If m <1 the conclusion holds with u =f. Suppose that m > 2 and the 
degree of f is at least 2. We look for a u of the form f + vb with v © F,,_,. For 
any such u, u(x) = f(x) for allx € df and Au = Af + A(vb). By Fischer’s Lemma, 
there exists a unique g in A, , such that A(gb) = —Af. Thus, if we define 


mn 


u :=f + gb, then u © #, and (#) holds. Uniqueness follows from Corollary 2. 


3. THE MEAN-VALUE PROPERTY AND THE WEIERSTRASS APPROXIMA- 
TION THEOREM. Let B = {x € R"”: |x| < 1}, S = 0B = {x © R”: |x| = 1}, and 
Bia) = {x € R":|x —a| <r}={a+ry:y € B} for a € R” and r > 0. Denote 
by o the normalized, n — 1 dimensional surface measure on S. The following 
result is a combination of [1, Theorems 1.2 and 1.20], which depend mainly upon 
the Divergence Theorem for B, a fairly self-contained exposition of which can be 
found in [3]. 


The Mean-Value Property. Suppose that Q, is a region in R", u:Q — R, and u is 
continuous. Then u is harmonic on Q if and only if 


u(a) = [ua + 7s)da(s) whenever B,(a) <Q. 


According to [6, p. 35], this theorem can be traced back to an 1840 paper 
of Gauss. 


Corollary. [f © is a region in R”, u,:Q— R is harmonic for each k © N, and 
{u,};_, converges uniformly on 0, to u: QO — R, then u is harmonic on Q. 


Sketch of Proof. Suppose B,(a) c QO. Then 
u,(a) = [ula +rs)da(s) forkeN and u(a) = ne u, (a). 
AY ae) 


Moreover, since each u, is continuous and the convergence is uniform on Q, u is 
continuous on () and 


nal [uda + rs)da(s) = [ua + rs) da(s). 


Hence 


u(a) = [ua + rs) da(s). 


By the Mean Value Property, u is harmonic on ). a 


It does not appear to be well known that, in [13], Weierstrass proved his famous 
approximation theorem not only in the case of a single real variable but also in 
higher dimensions. That “approximate identity” proof, in the one dimensional 
case, is the subject of Chapter 59 of the beautiful book of K6rner [11]. The same 
proof extends to higher dimensions without serious difficulty, see [9, p. 209 and 
Problem 1 on p. 213]. Because of its geometric appeal, its intimate relationship 
with the heat equation, and the fact that it affords C” approximation, Weierstrass’s 
own proof, in the author’s opinion, has not been bettered. 


The Weierstrass Approximation Theorem. Given a rectangle I in KK", a continuous 
function f : I > R, and e€ > 0, there exists a p in F such that |f(x) — p(x)| < e€ for 
allx € I. 


By a rectangle in IK" we mean a product of n closed bounded intervals. 
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4. THE DIRICHLET PROBLEM FOR ELLIPSOIDS; A SOLUTION 


Theorem 2. Given a continuous function f: d& — I, there is a unique continuous 
function u: & > R such that uis C* on &, Au(x) = 0 forallx € &, and u(x) = f(x) 
for allx © 0&. 


Proof: For x = (x,,...,x,) € R" let v(x) = Y4_,xj/r? and 
(v(x) x) if x #0 
if x = 0. 

Then f is continuous on R” and therefore, by Weierstrass’s Theorem, can be 
uniformly approximated on each rectangle by a polynomial. But f(x) = f(x) for 
all x < 0&. Hence there is a sequence {f,}7_,in A that converges to f uniformly 
on d&. 

According to Theorem 1, for each k € N there is a unique u, © # such that 


Au,(x) = 0 for all x © R” and u,(x) = f,(x) for all x © of. By the Maximum 
Principle, for j,k © N and x € @, 


|u;(x) — u,{x)| < max{| f;(y) —fy)|sye ae} >0 asj,k >, 
Thus {u,};_, converges uniformly on & to a continuous function u: > R. 


By the Corollary to the Mean Value Property, u is harmonic on #. Moreover, 
for x € d&, u(x) = lim, _,,,u,(x) = lim, _,..f, (x) = f(x). a 


f(x) = POOF 
0 


5. A FINITE DIMENSIONAL DIRICHLET PRINCIPLE. Inspired by Garding’s 
discussion of the Dirichlet principle, [8, pp. 96-98], we present 


Another Proof of Theorem I. Assume m > 2 and f © A,. For w © F,, and x € R” 
let Vis(x) = (0, W(x),..., 5, W(x)), the gradient of w at x, and let 


D() = i Vix) |° de—the Dirichlet integral. 


For w, x © F,, define BCw, vy) = f-Vis(x) - Vy(x) dx. Notice that B is a symmetric 
bilinear form on #, and Bw, /) = Dw) = 0 for all wy €©A,. Moreover, for 
w &F,, Dw) = 0 if and only if & is constant (.e., pW €F,). 

Let V={wb:w €A,_,} and A={ftv:vEV}. Note that V is a linear 
subspace of #, and every member of V vanishes on 0%. Hence, if we let 
Cw, x) = BU, x) for wW, y € V, then ¢-,-) is an inner product for V and its 
associated norm satisfies ||v||” = B(v, v) = D(v) for uv € V. 

Observe that A is an affine subspace of A, and if u € A then u(x) = f(x) for 
all x € 0€. 

We aim to prove that D has a unique minimizer on A, say u, and this u is a 
solution to our problem. For any v € V we have 


D(f +v) = [IVf) + Vuo(x)|° de = D(f) + 2B(f,v) + lull’. 


Now the map vu > B(f,v), for v € V, is a linear functional on the finite dimen- 
sional vector space V. Hence there is a unique g in V such that B(f,v) = (g, v) 
for all v € V. For v € V we therefore have 

D(f + v) = D(f) + 2¢g,v) + llull’ = D(f) + llu + gil’ — llgil’ 
and this is clearly least exactly when v = —g. Let u = f — g € A and conclude that 
D(u) < Dw) for all w € A. 
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Suppose thatO #v © V. If t © Rthen u + tv € A so that Dtu) < Du + tv) = 
D(u) + 2tB(u, v) + t?llull’, ie, 0 < 2¢B(u, v) + ¢?\lvl|° for all t © R. It follows 
that, for all v © V, 0 = Blu, v) = LL, f- 0, u(x) d,v(x) dx. But recall that every 
member of V vanishes on 02. Integration by parts therefore leads us to conclude 
that 0 = X_, f-(dzu(x))v(x)dx = {, Au(x)v(x) dx for all v € V. That is, 


0 = [Au 2)w(x)b(2) dx YforeverywEF,_,. (*) 


Now Au €#,_, and the map (y, vy) > feW(x)x(x)b(x) dx (WV €F,,_>) is 
clearly an inner product for 7, _,. Hence, (*) ensures that Au = 0. Since u € A, 
u(x) = f(x) for x € d& and uniqueness is guaranteed by Corollary 2 of the 
Maximum Principle. a 


6. REMARKS 


(i) Theorem 1 was proved in yet another nonstandard way in [5, Théorme 6, 
p. 60]. 

(ii) Suppose that g © A and we are interested in solving the Dirichlet prob- 
lem for Poisson’s equation: Au(x) = g(x), x € &, and u(x) = f(x), x © 0& 
where f: 0 — R is a given continuous function. By Fischer’s Lemma 
there exists a v in # such that g = A(vb). By Theorem 2, there exists a 
continuous w : & > R such that w is C? on &, Aw(x) = 0, for x € &, and 
w(x) = f(x) for x € d&. Let u(x) = w(x) + v(x)b(x) for x € &. Then u 
is continuous on ¥ and C? on &, Au(x) = Aw(x) + A(vb\x) = g(x) for 
x © &, and u(x) = w(x) = f(x) for x € d&. 

(iii) Theorems 1 and 2 apply to arbitrary ellipsoids and not just the “canonical” 
types considered so far. To see this it suffices to note that every ellipsoid 
is isometric to one of the kind we’ve considered and to check that the class 
of harmonic function is invariant under isometries; [1, pp. 2-3]. 

(iv) In place of the Laplacian one could substitute an operator of the form 

i_, A, 0; With positive real A,. 

(v) Extensions of Fischer’s ideas and applications thereof to differential 
problems have been given by several authors; see [10], [12], and the 
references included therein. 

(vi) Additional intriguing properties of harmonic polynomials, together with 
applications thereof to boundary value problems for B, can be found in 
the charming paper of Axler and Ramey [2]. 
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Said a mathematician named Haar, 
“Von Neumann can’t see very far. 
He missed a great treasure: 

They call it Haar measure. 

Poor Johnny’s just not up to par.” 


Contributed by Paul R. Chernoff, UC Berkeley, who provides the 
following background: John von Neumann had proved the existence of 
invariant measures on compact topological groups, but tried to discourage 
Alfred Haar from working on the locally compact case on the grounds 
that it seemed unlikely to be true in that generality. Fortunately, Haar 
persisted. 


834 THE DIRICHLET PROBLEM FOR ELLIPSOIDS [Monthly 106 


Curves Whose Curvature Depends on 
Distance From the Origin 


David A. Singer 


1. INTRODUCTION. The fundamental existence and uniqueness theorem for 
curves in Euclidean space R° states that a curve is uniquely determined up to rigid 
motion by its curvature and torsion, given as functions of its arc-length. Further- 
more, given continuous functions x(s) and 7(s), with «(s) positive and continu- 
ously differentiable, there is a differentiable curve (of class at least C°) with 
curvature « and torsion 7. 

In practice, such curves are often impossible to find explicitly, due to the 
difficulty in solving the Frenet Equations, the linear differential equations govern- 
ing the curve; but see [2] for an example where the equations can be solved. In 
general, a result of Lie and Darboux shows that solving these equations is 
equivalent to solving a certain complex Riccati equation; see [3, p. 36] for details. 
Happily, in the planar case the Frenet equations can always be integrated by 
quadratures. 

We consider a different sort of problem. Suppose the curvature of a proposed 
curve in the plane is given in terms of its position. Can the curve be determined, 
and if so, how? The general form of this problem requires one to solve a nonlinear 
differential equation: 


KOO -yOx() 
(x) +) 


An interesting solved example of this problem occurs when the curvature is 
proportional to one of the coordinate functions, say x(x, y) = cy. This is a 
remarkable property of the Euler elasticae, curves that minimize {x(s)* ds among 
curves of fixed length with fixed first-order boundary conditions. These are the 
“natural splines,” formed by taking a thin inextensible wire of uniform thickness 
and pinning and “welding” the two ends in fixed positions; see [4]. 

Among the Euler elasticae there is a unique closed curve, which is in the 
shape of a figure-eight. The curvature of this curve is given by the elliptic 
function sn(u, p), where u is proportional to the arc-length parameter and 
p = 0.9089086..., the elliptic modulus, satisfies the transcendental equation 
2E(p) = K(p). Here K and E are the complete elliptic integrals of the first and 
second kind. The curvature vanishes at the crossing point, and the x-axis bisects 
the figure, with one loop above and the other below the axis. 


= K(x(t), y(t)) (1.1) 


2. A FIRST ATTEMPT. The curvature of a planar curve is most simply defined 
using a coordinate system that moves with the curve. Let T be the unit tangent 
_ vector to the curve X(s), and let N be the vector orthogonal to T such that the 
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Figure 1. Some Euler elasticae. 
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Figure 2. The figure-eight Euler elastica. 


frame 7, N is positively oriented. If s is the arc-length parameter, then dX /ds = T, 
and the Frenet equations are 
dT dN 


ei, a er 21 
i 7 re ea) 


If we define the angle @ by writing the Euclidean coordinates of 7 as 
(cos 6, sin@), then we can rewrite (2.1) as 
do dx de, 
a K(s), a cos@, and ia sind. (2.2) 
Equations (2.2) show that once «x has been determined, the curve can be found 
by three quadratures. They are also very useful in computing numerical and 
graphical solutions to the Frenet equations in the plane. 
The condition we seek to have the curve satisfy is ||X|| = «. A direct assault on 


(2.2) after substituting « = y.x* + y* is unpromising. A more straightforward 
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approach might be to write X(s) = (x(s)cosd, «(s)sing) in polar coordinates. 
This leads to a pair of coupled second-order equations, namely 


d*« db ) dd ; dk dd d*d dk 
— =x|—} — «?—, 2—— + e— = k—, 
ds? | ds ds ds ds | ds? ds 
which can be reduced by use of the first integral 


dx \? , | db _ 


to a (somewhat unpleasant) second-order differential equation for x. Once this is 
solved the curve can be determined by quadratures. 

A slightly more devious approach is to define functions a(s) and B(s) by the 
formula 


(2.3) 


X=aT+ BN (2.5) 


in order to parametrize the curve by coordinates that move with the curve. 
Differentiating (2.5) gives 


da dB 
=. 7 AK +1, ee (2.6) 

The condition ||X|| = « in these coordinates is 
a? + B? =k’, (220) 


so we may define an angle w by a = xcosw, B = xsinw. The equations (2.6) now 
lead to a pair of first order equations: 

dk dif ; . 

Pies cosy, Kae + K° = —sinw. (2.8) 
Unfortunately, it is again not clear how to solve the differential equations and 
determine xk. 


3. MOTION UNDER GRAVITATIONAL FORCE. Although it is not always possi- 
ble to find explicit solutions to a system of second-order differential equations, this 
can be done by quadratures for completely integrable Hamiltonian systems. Thus, 
we now attempt to reformulate our problem as a Hamiltonian system with 
sufficient symmetry to be integrable. 

A fundamental example of such a system arises from the problem of an orbiting 
object. Newton’s equations of motion are F(X) = mX", where X is the position 
of the object. Suppose the object is moving through space under the influence 
of a central force F(X) = —f(r)X, where r= |l|X|| is the distance from the 
origin and f(r) is some continuous function. The motion satisfies the second order 
differential equations 


1 
X" = ——f(r)X, (3.1) 


and the motion satisfies Kepler’s Second Law (conservation of angular momentum): 
X X X' = C, a constant vector, as can easily be seen by differentiating. The motion 
lies in a plane, which we assume is the (x, y)-plane, and if we put the curve in 
polar coordinates X = (r cos¢,r sing), then 


dd 
re =|X xX X'|| =e, (3.2) 


where c = ||C|| is a constant (the angular momentum). 
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Now define a function ®(r) by 
d® 
os rf(r). 
Then F(X) = —VV, where V(x, y, z) = B(x? + y? + z*). The motion satisfies 
conservation of energy: 


Dias | ad 
ea ! + —_> — —- + eas 
gmx" oo = | : [< 


2 


+ P(r) = Ep, (3.3) 


where EF, is a constant (the total energy). 

Using (3.2) and (3.3), it is possible to solve the equations of motion by 
quadrature. In the special case where P(r) = —1/r, the force satisfies the inverse 
square law and we have the Kepler problem. The solutions in that case are 
ellipses, parabolas, and hyperbolas. 

Now let us compute the curvature of solutions. Since the parametrization is no 
longer by arc length, we need the general formula for the curvature of a curve: 


k(t) = Pe Re) (3.4) 

|x 
Plugging in (3.1) and (3.2), this equation becomes 
f(r) 

KL) Se (3.5) 

m|| X’ | 
Now using (3.3) to compute the denominator, we arrive at the formula 

cVm®'(r 

K(t) 2 (3.6) 


Note that the value of « depends only on r. Define u(r) = (E, — ®(7))7?. 
Then (3.6) reduces to 
du v2 a5 
— = ——r«(r). , 
ar cm"? ti 
If «(r) is any function such that rx(r) is continuous, we can solve (3.7) for p. 
Define ®(r) = —1/p(r)’, and consider solutions to (3.1) with energy E, = 0. 
Note that a specific choice of w also specifies a value of the momentum cy. That 
is, a solution to (3.1) has curvature proportional to the given function provided it 
has energy 0; its curvature is equal to the given function if its angular momentum 
C = Cy. This is part of the proof of our main result: 


Theorem 3.1. Let «(r) be a function such that r(r) is continuous. Then the problem 
of determining a curve whose curvature is K(r), where r is the distance from the origin, 
is solvable by quadratures. 


4. THE CASE x(r) =r. Using (2.2), it is not difficult to produce pictures of 
curves whose curvature is equal to the distance from the origin. Some of them are 
illustrated in Figure 3. We want to find analytic representations for these curves. 

Theorem 3.1 is slightly deceptive. The fact that the differential equation is 
integrable by quadratures does not mean that it is easy to perform the integrations, 
as we now illustrate with the case that inspired this paper: «(1) = r. The first step, 
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Figure 3. Curves with curvature equal to radial distance. 


solving (3.7) is easy and gives 


(1) = (+0), (1) = (4.1 
u(r) = r> +a), r) = -————,,, 
3covm 2(r? + a) 
where a is a constant of integration. This corresponds to a force law in which the 
magnitude is proportional to r*/(r? + a)’. 
Applying (3.3) and assuming that r’’ = cy, we obtain 
9p orb)’ 
CO eae CO) ee iran eee arr ere (4.2) 
(r° + a) (r° + a) 


This has among its solutions the circular orbit X(t) = (7, cos(at), ry sin(at)), 
where a = 3c,/ro(rg + a) and ria = cy. This implies 3r9 = rg + a. However, X is 
a solution to (3.1) only when a* = 27cpr,/(r¢ + a)”. This implies ry = 1, which of 
course we knew! Other solutions can have r’ = 0 only at isolated points. 

We can eliminate ¢ and find ¢ = (7) from 


re+a 
= + dr. (4.3) 
{Sa ry 9r? — (7? + ay” —(r°+ a) 


This is not always an elementary integral. One special case, however, is very 
pleasant, namely the case where a = 0. Then the integral becomes elementary, 
and the resulting curve is given by 


1 Z 
b- = 5 arcsin e . r? = 3sin2(¢— dy). (4.4) 


This curve is none other than (one leaf of) the Bernoulli lemniscate shown in 
Figure 4. 
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Figure 4. The Bernoulli lemniscate. 


From this example it becomes clear why the previous, more straightforward, 
attempts led to such complicated equations. The solution to the original equations 
would have given the lemniscate parametrized by arc length. However, the arc 
length function for the lemniscate is an elliptic integral. Thus even this special case 
would be much more complicated to find. In fact, we have not yet solved (3.1) 
explicitly, even in the special case; we found instead a non-parametric representa- 
tion of the trajectory. This is not difficult to do, however, as we observe in the 
next section. 


5. SUB-AFFINE ARC LENGTH. Solutions to equations of motion for particles 
moving under the influence of a central force have a natural parametrization. 
From the equation X” = g(x, y)X we have observed that ||X x X’|| is constant. 
Now let X(t) be a curve satisfying X x X' # 0. Then there is a re-parametriza- 
tion of the curve such that with respect to the new parameter o = a(t) the 
curve satisfies 


[x(o) xT) =1 5.1 
xX j=l. ‘ 
(0) x = (5.1) 

This parameter is computed by 


a(t) = fx x xX" |lde. (5.2) 
0 

In the case of the lemniscate r* = 3 cos(2@), evaluation of (5.2) is an elementary 

calculation and yields o = 3 sin(26). 

The parameter ao may be called the sub-affine arc length parameter. The 
motivation for this name is the following: If Y(t) is a curve in the plane such that 
the vectors Y'(t) and Y"(t) are linearly independent, then we may re-parametrize 
the curve by a parameter o such that with respect to this new parametrization 
\|Y' x Y” || = 1. This parametrization is known as the affine arc length; see [1, 
p. 149]. It plays a role in affine geometry exactly analogous to the usual arc length 
parameter in Euclidean geometry. If a curve is parametrized by affine arc-length, 
then Y” + pY’' =0, where p(c) is the affine curvature of the curve Y; it is a 
geometric invariant of the curve under unimodular affine transformations of 
the plane. 

If Y is a curve parametrized by affine arc length, then its derivative X(o) = 
dY/do is parametrized by subaffine arc length. We call p(a) = —d*X/do’ the 
subaffine curvature of X. This is a geometric invariant of X under unimodular 
linear transformations of R’. 
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Our problem was to find curves whose curvature is equal to the distance from 
the origin. Let y be such a curve. If y(t) x y(t) #0, then we can locally 
parametrize y by subaffine arc length. Then 


d*y 


aa = - Aa) (0). (5.3) 


The Euclidean curvature is given by the formula x(y(a)) = p(a) /|ly'(o) I’. 

If we now assume that the radial distance from the origin varies monotonically 
on some interval o, < 0 < o,, we can solve for o in terms of r and define 
f(r) = p(a(r)). Thus, on any part of a curve along which X x X’ does not vanish 
and X-X’ does not vanish, the curve arises as a solution to an equation of the 
form (3.1). 

Local extrema of r do not present any difficulty, since it is evident that the 
curve has a symmetry at each such point. Places where X X X' = 0, however, 
represent limits of trajectories. For example, in the example of the lemniscate the 
origin is a singularity of the orbit. Note, however, that these are of necessity 
isolated points on the curve. This observation completes the proof of the theorem. 

It is interesting to note that the graphical solution corresponding to the 
lemniscate actually produces two lemniscates at right angles to each other. Indeed, 
the solution to (2.2) with initial conditions x = 0, y = 0, @ arbitrary, sweeps out 
alternate halves of the two lemniscates, producing a flower with four loops. Note 
that the equation we have solved is for the signed curvature equaling the radial 
distance. Thus x is required to remain non-negative. The curvature of the 
lemniscate changes sign as it passes through the origin. 
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The Fifty-Ninth William Lowell Putnam 
Mathematical Competition 


Leonard F. Klosinski, Gerald L. Alexanderson, and Loren C. Larson 


The results of the Fifty-ninth William Lowell Putnam Mathematical Competition 
held December 5, 1998, follow. They have been determined in accordance with the 
regulations governing the competition, a contest supported by the William Lowell 
Putnam Prize Fund for the Promotion of Scholarship, a fund endowed by Mrs. 
Putnam in memory of her husband. The annual Competition is held under the 
auspices of the Mathematical Association of America. 

The first prize, $25,000, was awarded to the Department of Mathematics at 
Harvard University. The members of the winning team were Michael L. Develin, 

Ciprian Manolescu, and Dragos N. Oprea; each was awarded a prize of $1000. 
The second prize, $20,000, was awarded to the Department of Mathematics at 
the Massachusetts Institute of Technology. The members of the winning team were 
Amit Khetan, Eric H. Kuo, and Edward D. Lee; each was awarded a prize of $800. 

The third prize, $15,000, was awarded to the Department of Mathematics at 
Princeton University. The members of the winning team were Craig R. Helfgott, 
Michael R. Korn, and Yuliy V. Sannikov; each was awarded a prize of $600. 

The fourth prize, $10,000, was awarded to the Department of Mathematics at 
the California Institute of Technology. The members of the winning team were 
Christopher C. Chang, Christopher M. Hirata, and Hanhui Yuan; each was 
awarded a prize of $400. 

The fifth prize, $5,000, was awarded to the Department of Mathematics at the 
University of Waterloo. The members of the winning team were Sabin Cautis, 
Derek I. E. Kisman, and Soroosh Yazdani; each was awarded a prize of $200. 

The five highest ranking individual contestants, in alphabetical order, were 
Nathan G. Curtis, Duke University; Michael L. Develin, Harvard University; Kevin 
D. Lacker, Duke University; Ciprian Manolescu, Harvard University; and Ari M. 
Turner, Princeton University. Each of these has been designated a Putnam Fellow 
by the Mathematical Association of America and awarded a prize of $2,500 by the 
Putnam Prize Fund. 

The next ten highest ranking contestants, in alphabetical order, were: Sabin 
Cautis, University of Waterloo; Christopher C. Chang, California Institute of 
Technology; Donny C. Cheung, University of Waterloo; Dragos F. Ghioca, Univer- 
sity of Pittsburgh; Andrei C. Gnepp, Harvard University; Michael Ostrovsky, 
Stanford University; Colin A. Percival, Simon Fraser University; Alexander H. 
Saltman, Harvard University; Daniel A. Stronger, Harvard University; and Liang 
Yang, Yale University; each was awarded a prize of $1,000. 

The next eleven highest ranking contestants, in alphabetical order, were: Ian W. 
Caines, Dalhousie University; Scott H. Carnahan, California Institute of Technol- 
ogy; Adrian D. Corduneanu, University of Toronto; Matthew T. Gealy, University 
of Chicago; Larry D. Guth, Yale University; Amit Khetan, Massachusetts Institute 
of Technology; Michael R. Korn, Princeton University; Abhinav Kumar, 
Massachusetts Institute of Technology; Davesh Maulik, Harvard University; 
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Dragos Nicolae Oprea, Harvard University; and Ronfeng Sun, Clark University; 
each was awarded a prize of $250. 

The following teams, named in alphabetical order, received honorable mention: 
University of Chicago, with team members Benjamin M. Cowan, Matthew T. 
Gealy, and Christopher D. Malon; Duke University, with team members Nathan 
G. Curtis, Andrew O. Dittmer, and Carl A. Miller; The Johns Hopkins University, 
with team members Alexander J. Diesl, Rakesh M. Lal, and Nehal S. Munshi; 
Stanford University, with team members Eugene G. Davydov, Alexander S. Dugas, 
and Michael Ostrovsky; and University of Toronto, with team members Cyrus 
Chen Hsia, Bhaskara M. Marthi, and Ryan O'Donnell. 

Honorable mention was achieved by the following thirty-three individuals 
named in alphabetical order: Chetan Tukaram Balwe, University of Michigan, 
Ann Arbor; Adrian Birka, Massachusetts Institute of Technology; Dmitriy S. 
Boyarchenko, University of Pennsylvania; Li-Chung Chen, Harvard University; 
Constantin S. Chiscanu, Massachusetts Institute of Technology; John J. Clyde, 
Duke University; Shai M. Cohen, University of Toronto; Benjamin M. Cowan, 
University of Chicago; Samit Dasgupta, Harvard University; Kenneth K. Easwaran, 
Stanford University; Frederik H. Eaton, California Institute of Technology; Brad 
A. Friedman, University of Illinois, Urbana-Champaign; Craig R. Helfgott, 
Princeton University; Christopher M. Hirata, California Institute of Technology; 
Cyrus Chen Hsia, University of Toronto; Kai Huang, Massachusetts Institute of 
Technology; Liviu Ignat, University of Pittsburgh; Miro Jurisic, Massachusetts 
Institute of Technology; Scott P. Kempen, Marquette University; Derik I. E. 
Kisman, University of Waterloo; Eric H. Kuo, Massachusetts Institute of Technol- 
ogy; Chin-Aik Lee, Lebanon Valley College; Edward D. Lee, Massachusetts 
Institute of Technology; Christopher C. Mihelich, Harvard University; Carl A. 
Miller, Duke University; Mintcho P. Petkov, Dartmouth College; Dmitry L. 
Sagalovskiy, Harvard University; Yuliy V. Sannikov, Princeton University; Jan K. 
Siwanowicz, City College of New York; Mark J. Tilford, California Institute of 
Technology; Ian W. T. Vander Burgh, University of Waterloo; Cristian Voicu, 
St. Lawrence University; and Hoeteck Wee, Massachusetts Institute of Technol- 
ogy. 

The other individuals who achieved ranks among the top 103, in alphabetical, 
order of their schools were: University of British Columbia, Lawrence Tang; 
California Institute of Technology, Ryan L. McCorvie, Michael A. Shulman, 
Hanhui Yuan, and Kaiwen Xu; Case Western Reserve University, Andrew D. 
Frohmader; University of Central Florida, Daniel E. Moraseski; University of 
Chicago, Charles D. Cadmon, Christopher D. Malon, and Sergey Vassiliev; Uni- 
versity of Colorado, Boulder, Tao He; Duke University, Andrew O. Dittmer; 
Georgia Institute of Technology, Jeffrey M. Fowler; University of Georgia, Charles 
Rollin Mathis; Harvard University, Lukasz Fidkowski, Robert Ribciuc, and Joshua 
S. Vonkorff; Harvey Mudd College, Ranjithkumar Rajagopalan; The Johns Hop- 
kins University, Alexander J. Diesl; Massachusetts Institute of Technology, Michael 
L. Brasher, Pokman Cheung, Shamik Das, Ashish Mishra, Brent J. Yen, and Boris 
Zbarsky; McGill University, Alexandru E. Ghitza; University of Michigan, Ann 
Arbor, Kurt A. Steinkraus; University of Nebraska, Lincoln, Travis W. Fisher; New 
York University, Ioana Dumitriu; Northeastern University, Ivo K. Nikolov; Univer- 
sity of Pennsylvania, David Futer; Princeton University, Todd W. Geldon; Rice 
University, Brian David Rothbach; Simon's Rock College, Robert J. Young; 
University of South Carolina, Jason M. Burns; Stanford University, Eugene V. 
Davydov, Alex S. Dugas, and Siutaur Pang; University of Toronto, Bhaskara M. 
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Marthi, and Ryan W. O'Donnell; University of Vermont, Laura P. Riccio; Wash- 
ington University, St. Louis, Dan B. Johnston; and University of Waterloo, Joel 
Kamnitzer, and Soroosh Yazdani. 

There were 2,581 individual contestants from 419 colleges and universities in 
Canada and the United States in the Competition of December 5, 1998. Teams 
were entered by 319 institutions. The Questions Committee for the fifty-ninth 
Competition consisted of Michael J. Larsen (Chair), University of Pennsylvania; 
Steven G. Krantz, Washington University, St. Louis; and David J. Wright, 
Oklahoma State University; they composed the problems and were most promi- 
nent among those suggesting solutions. Alternate solutions to some problems have 
been published in Mathematics Magazine. 


THE 59 ANNUAL WILLIAM LOWELL PUTNAM EXAMINATION 


Al. A right circular cone has base of radius 1 and height 3. A cube is inscribed in 
the cone so that one face of the cube is contained in the base of the cone. 
What is the side-length of the cube? 


A2. Let s be any arc of the unit circle lying entirely in the first quadrant. Let A be 
the area of the region lying below s and above the x-axis and let B be the area 
of the region lying to the right of the y-axis and to the left of s. Prove that 
A +B depends only on the arc length, and not on the position, of s. 


A3. Let f be a real function on the real line with continuous third derivative. 
Prove that there exists a point a such that f(a): f’(a):f’(a): f(a) = 0. 


A4. Let A, =O and A, = 1. For n > 2, the number A,, is defined by concatenat- 
ing the decimal expansions of A,_, and A,,_, from left to right. For example, 
A, =A,A, = 10, A, = A,A, = 101, A; = A, A, = 10110, and so forth. De- 
termine all such that 11 divides A,,. 


A5. Let ¥ be a finite collection of open discs in R* whose union contains a set 
E CR’. Show that there is a pairwise disjoint subcollection D,,...,D, in F 
such that 


Here, if D is the disc of radius r and center P, then 3D is the disc of radius 3r 
and center P. 


A6. Let A, B,C denote distinct points with integer coordinates in R*. Prove that if 
(|AB| +|BC|)° < 8:[ ABC] +1 


then A,B,C are three vertices of a square. Here |XY| is the length of 
segment XY and [ABC] is the area of triangle ABC. 


B1. Find the minimum value of 
(x + 1/x)° — (x® + 1/x°) -— 2 
(x + 1/x)? + (x? + 1/23) 
for x > 0. 


B2. Given a point (a, b) with 0 < b <a, determine the minimum perimeter of a 
triangle with one vertex at (a, b), one on the x-axis, and one on the line y = x. 
You may assume that a triangle of minimum perimeter exists. 
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B3. Let H be the unit hemisphere {(x, y, z): x” + y* + z* = 1, z => O}, C the unit 
circle {(x, y,0):x* + y* = 1}, and P the regular pentagon inscribed in C. 
Determine the surface area of that portion of H lying over the planar region 
inside P, and write your answer in the form A sin a + Bcos B, where A, B, 
a, and £6 are real numbers. 


B4. Find necessary and sufficient conditions .on positive integers m and n so that 


mn-1 


3 ny i 
i=0 
B5. Let N be the positive integer with 1998 decimal digits, all of them 1; that is, 
N = 1111::-11. 
1998 digits 


Find the thousandth digit after the decimal point of VN. 


B6. Prove that, for any integers a,b,c, there exists a positive integer n such that 


Vn? + an* + bn +c is not an integer. 


SOLUTIONS. In the 12-tuples (1,9, 29, Ng, 17, Ne, Ms, Ng, Nz, M>,N,,Ny,n_,) fol- 
lowing each problem number, 1, for 10 >i > 0 is the number of students among 
the top 199 contestants achieving i points for the problem and n_, is the number 
of those not submitting solutions. 


Al. (156, 23, 4, 0, 0, 0, 0, 0, 0, 0, 16, 0) 
Solution. Consider a plane cross-section through a vertex of the cube and the axis 


of the cone shown in Figure 1. Let s be the side of the cube. Segment DE is a 


A 


B G C 


Figure 1. 


diagonal of the top of the cube and has length V2. By similarity of triangles ADE 
and ABC, we have DE/BC = AF /AG, or y2s/2 = (3 — s)/3. Hence 


6 3V2 9/2 — 6 
SS SS SS SS SS ee 
82 O° a eB 7 


A2. (103, 35, 26, 0, 0, 0, 0, 0, 9, 8, 12, 6) 


Solution. Suppose the beginning and ending angles of s are a and £, respectively, 
where angles are measured counterclockwise from the positive horizontal axis. 


A= fo Vi=x? de = [V1 = cos? u (—sin u) du = f*sin® udu. 
: B a 


cos B 
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Similarly 


B= ee —y*dy= [" cos? udu. 


sin a 


Then 


A+B= [° Cin? u + cos* u) du = [Pau = B- a. 
A3. (82, 34, 2, 0, 0, 0, 0, 0, 5, 0, 39, 37) 


Solution. Assume f(x)f’(x)f"(x)f"(x) < 0 for all x. By continuity, each of 
f, ff", f” has constant sign. By replacing f(x) by f(—x), —f(x), or —f(—x), if 
necessary, we may assume that f(x) > 0 and f’(x) > 0 for all x. In either case, 
choosing g =f or f’ as necessary, we have a function g such that g(x) > 0, 
g(x) > 0, and g”(x) <0 for all x. Since g is strictly increasing and bounded 
below, there is a constant C > 0 such that g(x) —~ C* as x —~ —©, This horizon- 
tal asymptote forces g to be concave upwards at some point x, in contradiction to 
our assumption g”(x) > 0. 

To be precise, fix x,, and let m = g'(x,) > 0. Since lim, , _.0g(x,) — g(x2))/ 
(x, — x») = 0, there is an x,, x, <x,, such that 0 < (g(x,) — g(x,))/(x, — x,) < 
m/2. By the Mean Value Theorem, there is an x, strictly between x, and x, with 
g'(x3) = (g(x,) — g(x,))/(x, — x.) < m/2. Then there is an x, between x, and 
x, for which 
g'(x1) — g'(%3) . aes 

Ay 3 4, — X3 


g" (x4) = > 0. 


A4. (39, 27, 52, 0, 0, 0, 0, 0, 49, 7, 14, 11) 


Solution. A,, is divisible by 11 precisely when n is one more than a multiple of 6. 

We know that a number is divisible by 11 if and only if the alternating sum of its 
digits is zero. So, let d,, be the alternating sum of the digits in A, starting from the 
left. Then d, =d,_, + (—1)""'d,_,, where r, is the number of digits in A,,. 
Clearly, r, = F, the nth Fibonacci number. The Fibonacci numbers have parity 
pattern ((—1)*"): —1, —1,1, —1, -1,1,.... Thus, the rules for calculating d, 
are d, = 0, d, = 1, d, =d, —d, =1, d, =d,+d,=2,d,=d,—d,=1,d,= 
d; —d,= —1,d,=d,+d,; =0, d, =d, —d, = 1, and so forth in this repeating 
pattern. 


AS. (85, 24, 15, 0, 0, 0, 0, 0, 5, 2, 8, 60) 


Solution. Let D, be a disc of greatest radius. Select D, to be a disc of greatest 
radius that is disjoint from D,. Proceeding inductively, suppose that D,,..., D;_, 
have been chosen. Select D;, to be a disc of greatest radius that is disjoint from 
DD iy Searels 5 

The process eventually stops, producing a pairwise disjoint collection of discs, 
D,,...,D,. We claim that U"_, 3D; > E. To prove this assertion, it suffices to see 
that Ul_, 3D, > Upeg D. 

Let D © J. If D is one of the selected discs D; then the assertion is trivial. If D 
is not one of the selected discs, then let D,, be the first disc in the sequence that 
intersects D. Then, by the way we chose the D,, the radius of D,, is at least as 
great as the radius of D. But the triangle inequality implies that 3D,, 3 D. 
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A6. (12, 1, 3, 0, 0, 0, 0, 0, 3, 9, 92, 79) 


Solution. The set Z? is preserved by 90° rotation about any of its points. 
Therefore, there exists a point C’ in Z* with |BC’| = |BA|, BC’ perpendicular to 
BA such that C and C’ belong to the same half-plane with respect to the line 
through A and B. If C #C’, then |CC’| =s > 1. Let r = |AB|. By rotating and 
translating the coordinate system, we may* assume that B = (0,0), A = (7,0), 
C’ = (0,r), and C = (0,r) + s(cos 0, sin 6), where @ is the angle to the line C’C 
measured from the positive horizontal axis. Thus 


2 
(| AB| +|BC|)° = (+ + Vs* cos? 6+ (r+ssin 0)” | 
= 2r? + 5? + 27s sin 6 + 2rys? cos? 6+ (r+ ssin 6)” 


> 2r* + 1+ 2rssin 6+ 2r(r+ssin 6) 


r(r+ssin 0) 
= aa ae +1= 8[ ABC | ar hs 


B1. (112, 30, 30, 0, 0, 0, 0, 0, 3, 3, 12, 9) 


Solution 1. Let z =x + 1/x. Then 


(x + 1/x)’ — (x® + 1/05) - 2 oe ee) 


(x + 1/x)? + (x3 + 1/x3) z3 + 2(z? - 3) 
=z) —2z(z* — 3) = 3z =3(x+1/). 


Observe that x + 1/x —2= (Vx —1/vVx)? = 0, with equality if and only if 
x = 1, so that the minimum of our function is 6. 


Solution 2. By direct expansion followed by long-division of polynomials, 


(xt 1/x) — (x8 + 1/x8)-2 3 2x8 + 5x54 6x4 + 5x? 42 


(x + 1/x)° + (x3 + 1/x3) x 2x° + 3x* + 3x7 +2 


3 
mee tL) S38 ede). 


By the arithmetic-geometric mean inequality, x + 1/x > 2yx(1/x) = 2, with 
equality if and only if x = 1. It follows that the minimum value is 6 and occurs 
when x = 1. 


B2. (82, 10, 9, 0, 0, 0, 0, 0, 3, 11, 38, 46) 


Solution. With two reflections, one about the x-axis and the other about y = x, 
the shortest perimeter for such a triangle is seen to equal the distance between the 


points (a, —b) and (b, a). This distance is ¥2(a* +b”). 
B3. (44, 4, 3, 0, 0, 0, 0, 0, 7, 2, 62, 77) 


Solution. Observe that twice the desired surface area equals the surface area of 
the whole sphere minus 5 spherical caps subtended by a “chord” with central angle 
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27/5. The surface area of a spherical cap of angle a can be computed in spherical 
coordinates by 


a/2 T 

J ; [ sin @ dOdd = 27(1 — cos(a/2)). 

0 “0 
Thus, the desired area is 

5(4a — 5(2m(1 — cos(7/5)))) = 5a cos( 7/5) — 377. 

That is, A = —37, B=57, a= 7/2, B= w/S. 
B4. (42, 9, 22, 0, 0, 0, 0, 0, 21, 28, 24, 53) 
Solution. It is necessary and sufficient that mn/(m,n)* is even, where (m,n) 


denotes the greatest common — Indeed, if m = ac and n = be, then |[i/ac| + 
|i/bc | depends only on |i/c |, s 


a gle lc es nll | 


cl ab—1 =| +|5| 
C y- pape > (-1) a 
k=0 i=0 
If ab is odd then a + b is even, so 
y ane = 
k=0 


and the sum of ab terms, each equal to +1, is nonzero. Conversely, if a + b is 
odd, then 


| fP) a of |S mo 1 a 1 = 1 (moa), 


Thus, 
yb 5 Ie <4 1 F '(—nbl Te a ae - 


B5. (55, 2, 29, 0, 0, 0, 0, 0, 8, 0, 29, 76) 


Solution. We have 


19198 | 10??? 


N = Sy 10 ~ 1998 ae 
Taylor’s theorem with remainder implies that 
197 1998 107 39% 
(1 — 107 1998 17" ied Sa pies 6 = 3 


Thus, the first digit after the decimal, point in 


191998 a | 1 10 19%8e 


10°? ¥N = —————__ + — + 
uN 3 6 3 


is 1. 
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B6. (25, 8, 8, 0, 0, 0, 0, 0, 4, 5, 31, 118) 


Solution. Let n = 4m’. Then 
n> + an? + bn +c = 64m° + 16am* + 4bm? + c 
= (8m3 +.am) + (4b—a?)m? +c. 


If m > 0, |\(4b — a?)m* + c| < 8m? + am,so n° + an? + bn +c can be a square 
only if (4b — a*)m +c =0. This can happen for only one value of m unless 
4b — a” =c = 0, and in this case n° + an* + bn + c is a square if and only if 7 is 
square. 


Santa Clara University, Santa Clara, CA 95053-0290 
Santa Clara University, Santa Clara, CA 95053-0290 
St. Olaf College, Northfield, MN 55057-1098 


UNDERSTANDING THE METRIC SYSTEM 


1 million microphones = | megaphone 
1 million bicycles = 2 megacycles 
2000 mockingbirds = 2 kilomockingbirds 
10 cards = 1 decacards 
1/2 lavatory = 1 demyohn 
1 millionth of a fish = 1 microfiche 
453.6 graham crackers = | pound cake 
10 rations = 1 decoration 
10 millipedes = | centipede 
3-1/3 tridents = | decadent 
10 monologues = 5 dialogues 
2 monograms = | diagram 
S nickels = 2 paradigms 
| 
| 


Plucked from the Internet by Russ Hood, Rio Linda, CA 
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NOTES 


Edited by Jimmie D. Lawson and William Adkins 


On the Nelson Unit Distance 
Coloring Problem 


Carsten Thomassen 


In 1950 Nelson raised the problem of coloring the Euclidean plane in such a way 
that no two points of distance 1 receive the same color. How many colors are 
needed? This problem was often mentioned in Paul Erddés’ famous lectures on 
unsolved combinatorial problems. The history of the problem is described in [2] 
and [3]. Clearly, three colors are needed. To see that four colors are needed, we 
consider seven points x,, x,,..., x, in the Euclidean plane such that the following 
pairs are of distance 1: 2X, x, X43, X%1X4, X3X4, XyX5, XX 6, AgX oy X4X7, X4X7, AUgX7, 
Xx. It follows from the theorem of de Bruijn and Erdés [1] that the number of 
colors needed for the whole plane is the maximum number of colors needed for 
the finite subsets. The half-century old upper bound 7 is obtained by drawing an 
appropriate graph in the plane such that each face (region) is bounded by a cycle 
of (Euclidean) diameter less than one and then coloring each face and part of the 
boundary by the same color in such a way that only faces of distance > 1 receive 
the same color. We prove that colorings of this type always need at least 7 colors. 
More generally, 7 colors are needed for any surface and any metric of large 
diameter provided there are no short noncontractible curves and no short con- 
tractible curves whose interior have large area. 

The upper bound 7 is obtained from a hexagonal tiling of the plane such that 
the hexagons are regular and of diameter slightly less than 1. All points inside a 
hexagon are colored with the same color. Two hexagons are colored differently if 
the distance between them is less than one. A coloring of this type will be called 
nice. More generally, we consider any metric space S, d such that S is a surface, 
l.e., S 1S an arcwise connected Hausdorff space such that each element of S has a 
neighborhood homeomorphic to an open disc in the Euclidean plane. We let G be 
a connected graph on S, 1.e., the vertices of G are elements of S, and the edges of 
G are simple arcs on S$ that are pairwise disjoint except at a common vertex. 
Moreover, we assume that each face (i.e., arcwise connected component of S\ G) 
has diameter less than 1, is homeomorphic to a disc, and is bounded by a cycle in 
G. Now a nice coloring of S obtained from G is a coloring such that each color 
class is the union of faces (and part of their boundaries) such that the distance 
between any two of these faces is greater than 1. We define the area of subset A 
of S§ as the maximum number of pairwise disjoint open discs of radius + that are 
contained in A. (If this maximum does not exist we say that A has infinite area.) 
We say that a simple closed curve C is contractible if S\C has precisely two 
arcwise connected components such that one of them is homeomorphic to an open 
disc in the Euclidean plane. That component is called the interior of C and is 
denoted int(C). (If S is a sphere, then int(C) denotes any component of S\C of 
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smallest area). We prove that every nice coloring of S needs at least 7 colors 
provided there exists a natural number k such that (i), (ii), Gili) below hold. 


(i) Every noncontractible simple closed curve has diameter at least 2. 
Gi) If C is a simple closed curve of diameter less than 2, then the area of 
int(C) is at most k. 
(ii) The diameter of S is at least 12k + 30. 


If any of these conditions (i), (ii), Giii) is dropped, then the number of colors 
needed may decrease. Thus a thin two-way infinite cylinder has a nice 6-coloring, 
which shows that (i) cannot be omitted. Similarly, a thin one-way infinite cylinder 
(with a small disc pasted on the boundary of the cylinder to form the bottom) 
shows that (ii) cannot be omitted. Finally, (iii) cannot be omitted since any sphere 
of diameter less than 1 has a nice coloring in two colors. 

D. R. Woodall [5] (see also [4} for a correction) has obtained a 6-color theorem 
related to the 7-color theorem in our Theorem 1. | 


2. A lemma on degrees in graphs. A graph G is a set V(G) of elements called 
vertices and a set E(G) of unordered pairs xy of vertices called edges. If the edge 
xy is present we say that xy joins x and y and that x and y are neighbors. The 
number of neighbors of x is the degree of x. A path from x to y is a graph 
consisting of distinct vertices x,,x5,...,x, and the edges x,x,,%,%3,...,X,_;X, 
where x, =x, x, = y. If we add the edge x,.x, we obtain a cycle. If x is a vertex, 
then D,(x) is the set of neighbors of x. More generally, if n > 2, then D,(x) is the 
set of vertices in V(G)\[{x} UD(x) UU: UD,_,(x)] having a neighbor in 
D,,_,(x). The subgraph of G induced by {x} U D,(x) U D,(x) U-:-is the con- 
nected component of G containing x. The graph G is connected if G has only one 
connected component. G is locally finite if D,(x) is finite for each vertex x in G. 
G is locally connected if for each vertex x, the subgraph of G induced by D,(x) is 
connected. G is locally Hamiltonian if, for each x in V(G), G has a cycle with 
vertex set D,(x). The graph of the icosahedron is locally Hamiltonian and has 12 
vertices all of degree 5. No larger connected graph has these properties. 


Lemma 1. If G is a connected, locally finite, and locally Hamiltonian graph with at 
least 13 vertices, then G has a vertex of degree at least 6. 


Proof: If no vertex has degree at least 6 we pick a vertex x of maximum degree. 
Clearly x has degree at least 3. If x has degree 3, then the subgraph of G induced 
by {x} U D(x) is the graph of the tetrahedron, because G has a cycle with vertex 
set D,(x). Since G has maximum degree 3, there is no vertex in D,(x). Since G is 
connected, G is the graph of the tetrahedron, contrary to the assumption that G 
has at least 13 vertices. If x has degree 4, then we consider a cycle in D,(x) and 
conclude that each vertex y in D,(x) has at most one neighbor z in D,(x). Since 
G has a cycle with vertex set D,(y), z has at least three neighbors in D,(x). So, 
there are at most 4 edges from D,(x) to D,(x), and every vertex in D,(x) has at 
least three neighbors in D,(x). Hence D,(x) has at most one vertex z. Since G has 
a cycle with vertex set D,(z), it follows that D,(x) = @. So, G has at most 6 
vertices, a contradiction. We may therefore assume that x has degree 5. 

Each vertex y in D,(x) has at most two neighbors in D,(x), because a cycle 
with vertex set D,(y) shows that y has at least two neighbors in D,(x), Since G 
has a cycle with vertex set D,(y), every neighbor z of y in D,(x) has at least two 
neighbors in D,(x). Now z cannot have two or more neighbors in D,(x) because 
then a cycle with vertex set D,(z) shows that z has at least two neighbors in 


November 1999] - 3 NOTES 851 


D(x), that is, z has a total of at least 6 neighbors, a contradiction. So z has at 
most one neighbor in D,(x) and that neighbor has at least three neighbors 
in D,(x). Since there are at most 10 edges from D,(x) to D,(x), and every vertex 
in D,(x) has at least two neighbors in D,(x), it follows that D,(x) has at most 5 
vertices. Hence there are at most 5 edges from D,(x) to D,(x). Since each vertex 
in D,(x) has at least three neighbors in D,(x), it follows that D,(x) has at most 
one vertex, and D,(x) = ©. Hence G has at most 12 vertices, a contradiction that 
completes the proof. i 


3. A 7-color theorem 


Theorem 1. Let G be a connected graph on a surface S satisfying (i), (ii), and (iii). 
Then every nice coloring needs at least 7 colors. 


Proof: Suppose (reductio ad absurdum) that there exists a coloring using at most 6 
colors. We define the map graph M = M(G, S) as the graph whose vertices are the 
faces of G such that two vertices in M are neighbors if and only if the correspond- 
ing facial cycles in G intersect. Consider any vertex x of M and let C, be the 
corresponding facial cycle in G. We choose an orientation of C, and let 
X15Xo5-++5X,,X, be the vertices in D,(x) listed in the order that they are 
encountered when we traverse C,. We now explain the idea behind the proof. 
We consider first the particularly nice case where, for each vertex x, all vertices 
X1,X5,-..,X, are distinct. In that case, M is locally Hamiltonian. Since the surface 
S is arcwise connected, it follows that M is connected. Since § has diameter 
greater than 13, M has more than 12 vertices, and hence, by Lemma 1, M has a 
vertex of degree at least 6. Now x and its neighbors must have distinct colors 
because x corresponds to a face of diameter <1 on S. This contradiction 
completes the proof in the particularly nice case where M is locally Hamiltonian. 

However, a vertex may appear several times in the sequence x,,X,,..., x, 
above, and some more careful analysis is needed. We omit those appearances 
(except possibly one) of x; for which C, and C, have only a vertex in common. In 
other words, if x, appears more than once in the new sequence, then we list only 
those appearances such that C, and C, share an edge. Then any two consecutive 
vertices in the sequence x,, x,,...,X,,X, are neighbors in M and so M is locally 
connected. It follows that M — x (that is, M with x and the edges incident with x 
removed) is connected. Moreover, if y is any other vertex of M, then M — x — y is 
connected unless y appears twice in the sequence x), x,,..., x,, that is, C, and C, 
have at least two edges in common. 

Consider now two vertices x and y such that C, and C, have at least two edges 
e and f in common (that is, y =x; =x, forl <i<j—1<k-—1). Let R bea 
simple closed curve in the faces bounded by C, and C, such that R crosses each of 
e and f precisely once and has no other point in common with G. By (i), R is 
contractible. Hence M — x — y is disconnected. We say that{x, y} is a 2-separator 
in M. For each vertex z in M such that C, is in int(R) and has color 1, we pick a 
point P, in int(C,), By Gi), there are at most k points P, and hence there are 
altogether at most 6k vertices z such that int(C,) C int(R). We define int(M, x, y) 
as the subgraph of M — x — y induced by all those vertices z in M such that C, is 
in int(R) for some R. Then each connected component of int(M, x, y) has at most 
6k vertices. Since S has diameter at least 12k + 3 it follows that G has two 
vertices whose graph distance is at least 12k + 2. Hence M—x-—y has some 
component that is not in int(M, x, y). We claim that M — x — y has precisely one 
such component, which we call ext(M, x, y). To see this, let e,,@,,...,¢€,, be the 
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edges in C, MC, occuring in that cyclic order on C,. Then e,,...,¢,, divide 
Dx)\{y} into m classes A,, A,,..., A,,. By letting {e, f} = {e,,e,, }U <i < m) 
in the preceding argument, we conclude that for each i= 1,2,...,m, either 
A, C int(M, x, y) or A; NM int(M, x, y) = ©. Since the former cannot hold for each 
i € {1,2,..., m}, the latter must hold for some i, and hence the former holds for 
all other 7 in {1,2,..., m}. Summarizing, for any two vertices x,y in M, M—-—x-y 
has precisely one connected component ext(M, x, y) with more than 6k vertices. 

If {u, v} is a 2-separator in M such that either x or y or both is in int(M, u, v), 
then clearly int(M, x, y) C int(M, u, v). (To see this, we use the properties of M 
established previously and forget about S.) If no such 2-separator {u, v} exists, then 
we say that {x, y} is a maximal 2-separator and that xy is a crucial edge. Since each 
connected component of int(M, x, y) has at most 6k vertices, then a maximal 
2-separator exists (provided a 2-separator exists). Let H be the subgraph of M 
obtained by deleting int(M, x, y) for each maximal 2-separator {x, y}. Then H # ©. 
Moreover, H is connected since a shortest path in M between two vertices in H 
never uses vertices in int(M, x, y). Similarly, H is locally connected. We claim that 
HT is locally Hamiltonian. Consider again a vertex x in H and the sequence 
X1,Xy,+--,X,,X, in D,(x) (taken in M). If this sequence forms a Hamiltonian 
cycle in D,(x) in H, we have finished. By the definition of H, k > 3. So assume 
that x;=x, where 1<i<j—1<k-—J1. Then {x,x,} is a 2-separator and 
the notation can be chosen such that int(M,x,.x,;} contains all the vertices 
Xi41>Xj425+++)X;-1. We repeat this argument for each pair i,j such that x; = x; 
where 1 <i<j—1<k-—1. Then the vertices in x,,x,,...,x,,x, that remain 
after we delete all vertices in the interiors of the 2-separators form a cyclic 
sequence with no repetitions. As H is connected and locally connected and has at 
least three vertices (by (iii)), the preceding reduced cyclic sequence has at least two 
distinct vertices. It cannot have precisely two vertices u, v because then H — u — v 
is disconnected, and hence M — u — v is disconnected (because M is obtained 
from H by “pasting graphs on edges of H”). Since one of the edges xu or xv is 
crucial (because D,(x) is smaller in H than in M), the maximality property of the 
2-separator {x, u} or {x, v} implies that ext(M, u, v) is the connected component of 
M —u-—v containing x. For each vertex z in that component, M has a path of 
length at most 6k from z to either x, u, or v. Hence M has diameter at most 
12k + 1, a contradiction that proves that H is locally Hamiltonian. 

If H has a vertex x of degree at least 6 we have finished because x and its 
neighbors must have different colors in the nice coloring. So, we may assume that 
each vertex of H has degree at most 5. By Lemma 1, H has at most 12 vertices. 
Hence H has at most 30 edges. Since M is obtained from H by “pasting” 
int(M, x, y} on the crucial edge xy for each crucial edge of H, we conclude that 
the diameter of M is at most 12k + 29, a contradiction to (iii). 
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Descartes’ Rule of Signs: 
Another Construction 


David J. Grabiner 


Descartes’ Rule of Signs is a simple, classical bound on the number of positive 
roots of a polynomial, and analogously on the number of negative roots. In 
Descartes’ own words, the rule is stated as follows: [3] 


An equation can have as many true [positive] roots as it has changes of sign, 
from + to — or from — to +. 


This statement is written in terms of sign changes of the coefficients, but the 
wording is very similar to the Intermediate Value Theorem, which says that a 
continuous function must have at least one root in an interval if it changes sign in 
that interval. This suggests a natural construction of the polynomial which achieves 
the bound by creating a correspondence between sign changes of its coefficients 
and sign changes of its value at designated points. 

Other constructions that achieve the maximum number of roots are known [2]. 
In this MONTHLY, Anderson, Jackson, and Sitharam [1] give a different natural 
construction, which works as long as all signs are nonzero. They construct a 
polynomial by choosing the roots according to the desired signs; they then show 
that the coefficients of the polynomial have the correct signs. They also show that 
this polynomial can be modified to obtain a polynomial with the same sign 
sequence and any number of positive roots that is a positive even integer less than 
the maximum. 

For our construction, we need only a single polynomial without signs attached 
to the coefficients; however we specify the signs, the value of the polynomial has 
the correct signs at the proper places. 


Theorem 1. Let o,...,0, be any sequence of —1, 0, and +1. Then for any 
k > n, the polynomial 


Hl 


Dt) = 2 Gee 
ja 


has the maximum number of positive and negative roots allowed by Descartes’ Rule 
of Signs. 


Proof: If o, # 0, then at x = k*', the absolute value of the term of x! is k*/~/ : 
Thus, at x = k?/, the absolute value of the term of x?! is k/” and the absolute 
values of the other terms are all at most k/"~'. Since there are only n such terms 
and k > n, the term of x*/ is larger in absolute value than all the others combined, 
and thus the sign of p(k*/) is the same as oj. Thus, if o; and o; are two 
consecutive opposite signs, the polynomial must have a root between x = k*' and 
aaa ce 

Analogously, for negative roots, the term of x*/ is larger in absolute value than 
all the other terms at x = —k*!, and thus if (—1)o, and (—1)/o, are two 
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consecutive opposite signs in p(—.x), the polynomial must have a root between 
x = —k*J and x = —k*! cs 


Since this theorem is valid for polynomials with zero coefficients, it is also easy 
to construct a polynomial with fewer roots than the maximum by making some of 
the coefficients very small and using the remaining signed coefficients to force the 
roots. In particular, we can make the number of positive roots short of the 
maximum by any even number. 


Theorem 2. Let o5,..., 0, be any sequence of —1, 0, and +1; we may assume 
0») # 0 and a, # 0 since the nonzero roots are not affected by eliminating zeros at the 
ends of the sequence. Let T,,..., 7, be obtained by changing some of the internal o; 
to zero, but keeping tT, and 1,, nonzero. Then there is a polynomial p(x) with sign 
sequence o that has as many positive roots as there are sign changes in the tT sequence 
and as many negative roots as there are sign changes in the (—1)'r, sequence. 


Corollary. If the a sequence has at least 2r sign changes, we can take the t sequence 
by changing to zeros those signs that are opposite to o, and precede the 2r-th sign 
change. This gives a polynomial with 2r fewer positive roots than the number of sign 
changes in the o sequence. 


Proof: Let q(x) be the polynomial for the + sequence; let its roots be x,,..., x,,- 
The polynomial g(x) has zero coefficients where certain signed coefficients are 
needed. We replace these zeros in g(x) by sufficiently small terms o,6x' in p(x) 
without affecting the number or signs of roots. 

Since g(x) is known to have its roots all in distinct intervals, it cannot have 
any double roots. Thus there is some e, such that no root of g'(x) is within e, 
of a root of q(x); we also require e, < |x,| for all i, which we can require 
since q(0) = tT, # 0. Since |q'(x)| is continuous, it is bounded on each interval 
[x,; — €,,x; + €,]; let €, be its minimum value on the union of all of these 
intervals. Finally, let €,; be the minimum of €,¢, and the value of |g(x)| on the 
intervals [—k?", x, — €,], [x;_, + €,, x; — €,], and [x,, + €,,k?"]. 

Then if we add to q(x) any differentiable function f(x) that has |f(x)| < e, 
and |f’(x)| < e, on[—k?",k*"], then f(x) + q(x) must still have one simple root 
in each interval [x, — €,, x; + €,], since it changes sign in each such interval and its 
derivative does not change sign there. Also, f(x) + q(x) cannot have any other 
root in [—k*",k*"], since |f(x)| < €,; < |g(x)| outside these intervals. Thus f(x) 
+ q(x) has the same number of roots as g(x) in [—k*", k?"], with the same signs. 

By the construction of q(x) with leading term +k™" x" and other terms whose 
absolute values are all at most k~"~'x" for x >k*", we have |q(x)| = 
(k — n\k-"~')x" for all x > k2". Thus if [f(x] < (k — nko" 7 !)x" 
for all x > k*", then f(x) + q(x) has no roots with absolute value greater than 
k?", and thus has the same number of positive and negative roots as gq. 

We can thus let p(x) = f(x) + q(x), where 


n—-1 
f(x) = = (0; = 7) Ox!, 
j=l 
with 6 sufficiently small to meet the conditions on f. This polynomial has the 
correct signs and the correct number of positive and negative roots. i 


Note that this technique does not allow us to obtain simultaneously all possible 
numbers of positive and negative roots. In fact, this turns out to be impossible; 
some combinations of positive and negative roots cannot be obtained at all. The 


November 1999] NOTES 855 


simplest impossible case is the sequence +,0, —,0, +, corresponding to the 
polynomial ax* — bx* +c = 0, which has —x as a root if it has x as a root and 
thus cannot simultaneously have no positive and two negative roots. 

Even if no signs are zero, it may not be possible to obtain simultaneously all 
admissible numbers of positive and negative roots. For example, the sequence 
+,—,-—,-—, + has two positive and two negative sign changes. It is possible for a 
polynomial with this sign sequence to have two*negative or zero positive real roots, 
but not both simultaneously. A fourth-degree polynomial with only two negative 
real roots for which the sum of the roots was positive could be factored as 
a(x? + bx + ch x* — sx +t) with a,b,c, s,t > 0, s* < 4t, and b* > 4c. The prod- 
uct of these factors is a(x* + (b — s)x* + (t +c — bs)x* + (bt — cs)x + st). To 
get the correct sign sequence, we need b <5 and bt < cs, which gives b*t < s*c 
and thus b?/c < s*/t. But we have b*/c > 4 > s*/t. 

This counterexample provides a negative answer to the question raised in [1] 
whether it is possible to get a polynomial with an arbitrary sign sequence and any 
simultaneous numbers of positive and negative roots allowed by Descartes’ Rule of 
Signs. This suggests a new conjecture: the only possible numbers of positive and 
negative roots are the maximum values permitted by Descartes’ Rule of Signs 
In a sequence obtained by changing some of the internal signs to zeros as in 
Theorem 2. The above cases and the analogous +,+,—,+,+ confirm the 
conjecture for degree 4. 
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Integrating Polynomials in Secant 
and Tangent 


Jonathan P. McCammond 


This note provides a relatively painless way to integrate arbitrary polynomials in 
secant and tangent without invoking integration by parts or anything beyond 
elementary polynomial and trigonometric identities. The techniques involved also 
introduce some of the ideas behind the construction of Laurent polynomials, 
although the manner in which they do so is rather indirect. We begin with a 
theorem that covers almost all possibilities. 


Theorem 1. For each polynomial P(s,t) in two variables, there are polynomials F 
and G in one variable and a constant c such that 


[P(sec x, tan x)sec xdx = F(u) — G(v) +clIn(u) + C 


where u = secx + tan x and v = secx — tan x. 
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Proof: Once we define u = secx + tan x and v =secx — tan x, it is easy to 
check that sec x = (u + v)/2, tan x = (u — v)/2, and uv = sec’ x — tan’ x = 1. 
(The traditional construction of Laurent polynomials involves quotienting the 
two-variable polynomial ring by the two-sided principal ideal generated by the 
polynomial xy — 1. Since uv = 1, the polynomials in secant and tangent can be 
viewed as Laurent polynomials in the single variable u = sec x + tanx. This is an 
abstract algebra explanation for the simplifications that follow.) If we replace sec x 
and tan x with their equivalents in terms of u and v, then the polynomial P in 
sec x and tan x becomes a polynomial in u and v instead. Moreover, since uv = 1, 
we can replace any instance of uv by 1, quickly reducing any monomial contain- 
ing both variables to one that contains at most a single variable. In other 
words, the resulting polynomial in u and vu can always be written in the form 
f(wu + g(v)v + c where f and g are polynomials of a single variable and c is a 
constant. 

Next consider the differentials du and dv. Since du = sec x tan x + sec” xdx 
and du = sec x tan x — sec’ xdx, we find that sec xdx = (1/u) du = —(1/v) dv. 
We are now ready to calculate the original integral. 


[ P(sec x, tan x)sec xdx = fu + g(v)v + c)sec xdx 


= [flu du ~ fe(v) dv + of ~ du 
= F(u) — G(v) + clnjul +C 


where “(u) and G(v) represent the antiderivatives of the polynomials f(u) and 
g(v), respectively. - 


Example 2. Consider the integral { 16sec” x dv. We find that 
P(s,t) = 16s* 


u+v u-—vD 
(so) 


5 5 (u +v)* 


u* + 4u°v + 6u7v* + 4uv> + v* 


ut + 4u* +6 + 4v7 + v4 


Thus f(u) = u? + 4u, g(v) = 4u + v°, and c = 6; we immediately conclude that 


4 


u* v 
[16 sec’ xdx = +204] - due + 7 + 6Inju| + C 


where u = secx + tanx and v = sec x — tanx. The standard approach would 
involve performing integration by parts twice in order to reduce the exponent of 
the integrand. 

Theorem 1 is nearly comprehensive in the sense that the only monomials in 
secant and tangent that are not covered are the constant term and those of the 
form tan” x. Closed forms for these integrals exist [1], but a little bit of trigonom- 
etry brings them within the reach of Theorem 1. Consider, for example, the 
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monomial tan’ x. By repeatedly applying the identity tan? x = sec? x —1 we see 
that 


tan’ x = tan? x sec” x — tan’ x 
= tan? x sec” x — tan®? x sec” x + tan? x 
= tan x sec” x — tan® x sec” x + tanx sec” x — tanx 


In the final expression, the first three terms can be integrated using Theorem 1, so 
that only tan x remains. More generally, given any polynomial in secant and 
tangent the pure powers of tangent can be modified in this way so that the result is 
the sum of a constant term, a constant multiple of tan x, and a polynomial to 
which Theorem 1 can be applied. This completes the proof of the following result. 


Theorem 3. For each polynomial P(s, t) in two variables, there are polynomials F 
and G in one variable and constants a, b, and c such that 


[ P(sec x, tan x) dx = F(u) — G(v) + alIn|u| — b In|cosx| + cx + C 
where u = Secx + tan x and v = secx — tan x. 
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Edited by Abe Shenitzer 


Mathematics and Statistics, York University, Toronto, Ontario M3J 1P3, Canada 


Field Theory: From Equations 
to Axiomatization 
Part II 


Israel Kleiner 


7. THE ABSTRACT DEFINITION OF A FIELD. The developments we have been 
describing thus far lasted close to a century. They gave rise to important “concrete” 
theories—Galois theory, algebraic number theory, algebraic geometry—in which 
the (at times implicit) field concept played a central role. At the end of the 19th 
century abstraction and axiomatics were “in the air.” For example, Pasch (1882) 
gave axioms for projective geometry, stressing for the first time the importance of 
undefined notions, Cantor (1883) defined the real numbers essentially as equiva- 
lence classes of Cauchy sequences of rationals, and Peano (1889) gave his axioms 
for the natural numbers. In algebra, von Dyck (1882) gave an abstract definition of 
a group that encompassed both finite and infinite groups (about thirty years earlier 
Cayley had defined a finite group), and Peano (1888) gave a definition of a 
finite-dimensional vector space, though this was largely ignored by his contempo- 
raries. The time was propitious for the abstract field concept to emerge. Emerge it 
did in 1893 in the hands of Weber (of Dedekind-Weber fame). 

Weber’s definition of a field appeared in his 1893 paper “Die allgemeinen 
Grundlagen der Galois’schen Gleichungstheorie” [15], in which he aimed to give 
an abstract formulation of Galois theory [8, p. 136]: 


In the following an attempt is made to present the Galois theory of algebraic 
equations in a way which will include equally well all cases in which this 
theory might be used. Thus we present it here as a direct consequence of the 
group concept illuminated by the field concept, as a formal structure com- 
pletely without reference to any numerical interpretation of the elements 
used. 


Weber’s presentation of Galois theory is indeed very close to the way the subject is 
taught today. His definition of a field, preceded by that of a group, is as follows 
[15, pp. 526-527]: 


A group becomes a field if two types of composition are possible in it, the 
first of which may be called addition, the second multiplication. The general 
determination must be somewhat restricted, however. 

1. We assume that both types of composition are commutative. 

2. Addition shall generally satisfy the conditions which define a group. 
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3. Multiplication is such that 
a(—b) = —(ab) 
a(b +c) =ab + ac 
ab = ac implies b = c, unless a = 0 


Given b and c, ab = c determines a, unless b = 0. 


Although the associative law under multiplication is missing, and the axioms are 
not independent, they are of course very much in the modern spirit. As examples 
of his newly defined concept Weber included the number fields and function fields 
of algebraic number theory and algebraic geometry, respectively, but also Galois’s 
finite fields and Kronecker’s “congruence fields” K[x]/(p(x)), K a field, p(x) 
irreducible over K. 

Weber proved (often reproved, after Dedekind) various theorems about fields, 
which later became useful in Artin’s formulation of Galois theory, and which are 
today recognized as basic results of the theory. Among them are [8], [10]: 


(i) Every finite algebraic extension of a field is simple (that is, it is generated 
by a single element). 
(ii) Every polynomial over a field has a splitting field. 
(ii) If F ¢ F(a) c F(b), then (F(a): F) divides (F'(b): F), where for fields K 
and E with E C K, (K: E) denotes the dimension of K as a vector space 
over E. 


It should be emphasized that it was not Weber’s aim to study fields as such, but 
rather to develop enough of field theory to give an abstract formulation of Galois 
theory [11]. In this he succeeded admirably. His paper, and somewhat later his 
two-volume Lehrbuch der Algebra, exerted considerable influence on the develop- 
ment of abstract algebra [3]. 


8. HENSEL’S P-ADIC NUMBERS. In an 1899 article entitled ““New foundations 
of the theory of algebraic numbers,” Hensel began a life-long study of p-adic 
numbers. Inspired by the work of Dedekind-Weber, Hensel took as his point of 
departure the analogy between function fields and number fields. Just as power 
series are useful for a study of the former, Hensel introduced p-adic numbers to 
aid in the study of the latter [10, II, p. 19]: 


The analogy between the results of the theory of algebraic functions of one 
variable and those of the theory of algebraic numbers suggested to me many 
years ago the idea of replacing the decomposition of algebraic numbers, with 
the help of ideal prime factors, by a more convenient procedure that fully 
corresponds to the expansion of an algebraic function in power series in the 
neighborhood of an arbitrary point. 


Indeed, in the neighborhood of a given point a every algebraic function of a 
complex variable can be represented as an infinite series of integral and rational 
powers of z— a, as Weierstrass had shown. The elements of Hensel’s field of 
p-adic numbers are formal power series 1“ a, p*, where a, € Z, and n © Z. And 
just as every element of an algebraic function field can be identified with the set of 
its expansions at all points of the Riemann surface on which it is defined, so every 
element of an algebraic number field is identified with the set of its representa- 
tions in the field of p-adic numbers ©a, p* for every prime p [2, p. 111]. 
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In a 1907 book, Hensel introduced topological notions in his p-adic fields and 
applied the resulting p-adic analysis in algebraic number theory. The p-adic 
numbers proved extremely useful not only there but also in algebraic geometry 
[4], [7]. They were also influential in motivating the abstract study of rings and 
fields [3]. 


9. STEINITZ. The last major event in the evolution of field theory that we 
describe is Steinitz’s great work of 1910 [13]. But first some background. 

Algebra in the 19th century was by our standards concrete. It was connected in 
one way or another with the real or complex numbers. For example, some of the 
great contributors to 19th-century algebra, mathematicians whose ideas shaped the 
algebra of the 20th century, were Gauss, Galois, Jordan, Kronecker, Dedekind, 
and Hilbert, and their algebraic work dealt with quadratic forms, cyclotomy, 
permutation groups, ideals in rings of algebraic number fields and algebraic 
function fields, and invariant theory. All of these subjects were related in one way 
or another to the real or complex numbers. 

At the turn of the 20th century the axiomatic method began to take hold as an 
important mathematical tool. Hilbert’s Foundations of Geometry of 1899 was very 
influential in this respect (see also our Section 7). Noteworthy also is the American 
school of axiomatic analysis, as exemplified in the works of Dickson, Huntington, 
E. H. Moore, and Veblen. In the first decade of the 20th century these mathemati- 
cians began to examine various axiom systems for groups, fields, associative 
algebras, projective geometry, and the algebra of logic. Their principal aim was to 
study the independence, consistency, and completeness of the axioms defining any 
one of these systems. Also relevant were Hilbert’s axiomatic characterization in 
1900 of the field of real numbers and Huntington’s like characterization in 1905 of 
the field of complex numbers [1], [3]. 

Steinitz’s groundbreaking 150-page paper “Algebraische Theorie der Korper” of 
1910 initiated the abstract study of fields as an independent subject [13]. While 
Weber defined fields abstractly, Steinitz studied them abstractly. 

Steinitz’s immediate source of inspiration was Hensel’s p-adic numbers (3, 
p. 194]: | 


I was led into this general research especially by Hensel’s Theory of Algebraic 
Numbers, whose starting point is the field of p-adic numbers, a field which 
counts neither as a field of functions nor as a field of numbers in the usual 
sense of the word. 


More generally, Steinitz’s work arose out of a desire to delineate the abstract 
notions common to the various contemporary theories of fields: fields in algebraic 
number theory, in algebraic geometry, and in Galois theory, p-adic fields, and 
finite fields. His goal was a comprehensive study of all fields, starting from the 
field axioms [3, p. 195]: 


The aim of the present work is to advance an overview of all the possible 
types of fields and to establish the basic elements of their interrelations. 


Quite a task! Steinitz’s plan was to start from the simplest fields and to build up all 
_ fields from these. The basic concept that he identified to study the former is the 
characteristic of the field. Here are several of his fundamental results, nowadays 
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staples of field theory [10], [13]: 


(i) Classification of fields into those of characteristic zero and those of 
characteristic p. The prime fields—the “simplest” fields—are Q and Z,; 
one or the other is a subfield of every field. 

(ii) Development of a theory of transcendental extensions, which became indis- 
pensable in algebraic geometry. : 

(iii) Recognition that it is precisely the finite, normal, separable extensions to 
which Galois theory applies. 

(iv) Proof of the existence and uniqueness (up to isomorphism) of the algebraic 
closure of any field. 


A description of all fields followed [11, p. 754]: 


Starting with an arbitrary prime field, by taking an arbitrary, purely transcen- 
dental extension followed by an arbitrary algebraic extension, we have a 
method of arriving at any field. 


The notions of transcendency base and degree of transcendence of an extension field, 
both of which Steinitz introduced, played a crucial role here. Also important was 
the well-ordering principle (or, equivalently, the axiom of choice), whose use he 
acknowledged [10, II, p. 20]: 


Many mathematicians continue to reject the axiom of choice. The growing 
realization that there are questions in mathematics that cannot be decided 
without this principle is likely to result in the gradual disappearance of the 
resistance to it. 


Steinitz’s work was very influential in the development of abstract algebra in the 
1920s and 1930s, as the following testimonials prove: 


Steinitz’s paper was the basis for all [algebraic] investigations in the school of 
Emmy Noether (van der Waerden, [14, p. 162]). 


[Steinitz’s work]...is not only a landmark in the development of algebra, but 
also...an excellent, in fact indispensable, introduction to a serious study of 
the new [modern] algebra (Baer and Hasse, [13, Preface]). 


Steinitz’s work marks a methodological turning-point in algebra leading to 
... ‘modern’ or abstract algebra (Purkert and Wussing, [11, p. 754)). 


[Steinitz’s work] can be considered as having given birth to the actual concept 
of Algebra (Bourbaki, [2, p. 83]). 


10. A GLANCE AHEAD. We now list several major developments in field theory 
and related areas in the decades following Steinitz’s fundamental work. 

(a) Valuation theory. In 1913 Kurschak abstracted Hensel’s ideas on p-adic fields 
by introducing the notion of a valuation field. He proved the existence of the 
completion of a field with respect to a valuation. In 1918 Ostrowski determined all 
valuations of the field Q of rational numbers. Valuation theory, which “forms a 
solid link between number theory, algebra, and analysis” [7, vol. II, p. 537], played 
fundamental roles in both algebraic number theory and algebraic geometry; see [2], 
[4], [7], [14]. 

(b) Formally real fields. In 1927 Artin and Schreier defined the notion of a 
formally real field, namely a field in which —1 is not a sum of squares. “One of 
_ [the] remarkable results [of the Artin-Schreier theory] is no doubt the discovery 
that the existence of an order relation on a field is linked to purely algebraic 
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properties of the field” [2, p. 92]: A field can be ordered if and only if it is formally 
real. The theory of formally real fields enabled Artin in the same year to solve 
Hilbert’s 17th Problem on the resolution of positive definite rational functions into 
sums of squares [7, vol. I, p. 640]. 

(c) Class field theory. This is the study of finite extensions of an algebraic number 
field having an abelian Galois group. It is a beautiful synthesis of algebraic, 
number-theoretic, and analytic ideas, in which Artin’s Reciprocity Law has a central 
place. Major strides were already made by Hilbert in his “Zahlbericht’”’ (Report on 
Number Theory) of 1897. More modern aspects of the theory were developed by 
Artin, Chevalley, Hasse, Tagaki, and others; see [5]. 

(d) Galois theory. Artin set out his now-famous abstract formulation of Galois 
theory in lectures given in 1926 (but published only in 1938). In a 1950 talk he said 
[8, p. 144]: 


Since my mathematical youth I have been under the spell of the classical 
theory of Galois. This charm has forced me to return to it again and again, 
and try to find new ways to prove its fundamental theorems. 


Extensions of the classical theory were given in various directions. For example, in 
1927 Krull developed a Galois theory of infinite field extensions, establishing a 
one-one correspondence between subfields and “closed” subgroups, and thereby 
introducing topological notions into the theory. There is also a Galois theory for 
inseparable field extensions, in which the notion of derivation of a field plays a 
central role, and a Galois theory for division rings, developed independently by 
H. Cartan and Jacobson in the 1940s; see [7], [16]. 

(e) Finite fields. Finite field theory is a thriving subject of investigation in its own 
right, but it also has important uses in number theory, coding theory, geometry, 
and combinatorics; see [6], [9]. 
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larsky, Richard Stong, Charles Vanden Eynden, and William E. Watkins. 


Proposed problems and solutions should be sent in duplicate to the MONTHLY 
problems address on the inside front cover. Submitted problems should include 
solutions and relevant references. Submitted solutions should arrive at that address 
before April 30, 2000; Additional information, such as generalizations and refer- 
ences, is welcome. The problem number and the solver’s name and address should 
appear on each solution. An acknowledgement will be sent only if a mailing label 
is provided. An asterisk (*) after the number of a problem or a part of a problem 
indicates that no solution 1s currently available. 


PROBLEMS 


10760. Proposed by Bruce Reznick, University of Illinois, Urbana, IL. A function f: N > C 
is completely multiplicative if f(1) = 1 and f(mn) = f(m) f(n) for all positive integers 
m and n. Find all completely multiplicative functions f with the property that the function 
F(n) = >-y_, f (k) is also completely multiplicative. 

10761. Proposed by Fred Galvin, University of Kansas, Lawrence, KS. Let G be a graph 
with n vertices. For each vertex v, let f(v) be the maximum cardinality of an independent 


set of neighbors of v. Show that 5° f(v) < n? /2, where the sum is taken over all vertices 
of G. 


10762. Proposed by Leroy Quet, Denver, CO. Let x} = 1, and for m > 1 let xm41 = 
(m+ 3/2)? pis XeXm+1—k. Evaluate limm—+oo Xm/Xm+41- 

10763. Proposed by Jean Anglesio, Garches, France. Let ABC be a triangle; let O be its 
circumcenter, H its orthocenter, J its incenter, N its Nagel point, and X, Y, Z its excenters. 
Let S be defined so that O is the midpoint of HS, and Jet T denote the midpoint of SN. 
It is known that the orthocenter and the nine-point center of triangle XYZ are J and O, 
respectively. Prove that 

(a) the circumcenter of triangle X Y Z is T; and 

(b) the centroid of triangle X Y Z is the centroid of SIN. 


10764. Proposed by Ray Redheffer, University of California, Los Angeles, CA. Let A = (aj;) 
be a real n-by-n matrix, and let x and y be real n-vectors satisfying Ax = y. Suppose that 


>. max{aj;,0} < yj < ajj + De min{q;;, 0} 
J#i J#l 
for alli € {1,2,...,n}. Show that x; > O for alli € {1,2,...,n}. 
10765. Proposed by Peter J. Ferraro, Roselle Park, NJ. Let f, be the nth Fibonacci number, 


defined by f; = fo = 1 and fn42 = fna4t + f, forn > 1. Fix positive integers k and n 
with n > 2k + 1. Prove that | */fn | — L4/fn—-k + */fn—2x | is O unless f;, is a kth power, 


when it is 1. 
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10766. Proposed by Szilard Andras, Babes-Bolyai University, Cluj-Napoca, Romania. Let 
x, y, and z be nonnegative real numbers. Prove that 


(a) (x ty $e zyPtrtt x%y¥z2 < (et yy (y $2) (zt xe, 
(b) (x ty + yO ey 22 > (ot yyP HY (yt VOT! (2 4 xe”, 


SOLUTIONS 


Cramer’s Rule for Non-Square Matrices 


10618 [1997, 768]. Proposed by S. Lakshminarayanan, S. L. Shah, and K. Nandakumar, 
University of Alberta, Edmonton, Canada. Let A be areal m x n matrix of full rank with 
m <n and let b be areal m x 1 matrix. For 1 <i <n, define 


_ det(A* A?) — det(A; A?) 


Xj —— ) 
det(AA‘) 
where A* is obtained by replacing the ith column of A by bd, and A; is obtained by deleting 
the ith column of A. Show that x = [x,,..., X,]/ is asolution to the linear system Ax = b. 


Solution by the GCHO Problems Group, Cheltenham, U. K. We write A'(b) instead of A; 
to emphasize the role of the vector b; thus A’ (0) indicates A with its ith column zeroed out. 
Observe that Aj Ai = A'(0)A!, by comparing corresponding entries. 

Extend A to a nonsingular n x n matrix (4), where C is an (n — m) x n matrix whose 
rows form an orthonormal basis for the orthogonal complement of the row space of A. That 


is, each row of C has norm 1 and is orthogonal to all other rows of (4). We have 


A\(A\" (AAT 0 ; Al(b)\ (A\" — ( Ai(b)AT  M 

c)\c) ~\o 1 - CJ. © ae 
where J is the (n — m) x (n — m) identity matrix and M is some n x (n — m) matrix. 
By substituting these computations into the definition of x;, canceling the nonzero factor 
det aw and using the linearity of the determinant in its ith column, we obtain 


_ det (MEY) ) dee (CHENEY) _ aoe (4) — aot MO) _ der (4)'C 


det ((4)(A)") det (2) det (2) 
By Cramer’s rule, x is the solution to the linear system (4)x = (6), and hence x is a solution 


to Ax = b. 


Solved also by J. Fuelberth & A. Gunawardena, J. H. Lindsey II, M. Sharma & P. G. Poonacha (India), WMC Problems Group, 
and the proposers. 


i 


An Identity for Strongly Connected Digraphs 


10620 [1997, 870]. Proposed by James Propp, Massachusetts Institute of ‘Technology, 
Cambridge, MA. A digraph on a vertex set V is asubset A C {(v, w): v, w Ee Viv 4 w} 
and is strongly connected if it is possible to get from any vertex a to every other vertex e 
by a finite succession of arcs (a, b), (b, c),..., (d,e) in A. Forn > 1, let E,, (respectively, 
O,,) denote the number of strongly connected digraphs on the vertex set V = {1,2,...,n} 
with an even (respectively odd) number of arcs. Show that E, — O, = (n — 1)! for all 
n> 1. : 


~ Solution I by the proposer, currently at University of Wisconsin, Madison, WI. The termi- 
nology of the problem statement is somewhat nonstandard. In common usage, a digraph is 
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a pair (V, A), in which A is the set of arcs in the digraph. Edges from a vertex to itself, 
and even multiple edges from one vertex to another, are ordinarily allowed; the digraphs 
considered here with no such edges are usually called simple. The “succession of arcs” 
described above is a path from a to e. 

Let the sign of a digraph with arc set A be (—1)!4!; the problem is to show that the sum 
Sy of signs of the strongly connected digraphs with vertex set [n] = {1,...,n}is (n — 1)!. 
Let r,, be the sum of the signs of all the digraphs on the vertex set [n] containing paths from 
n to all other vertices. i 

We prove first that r, = s, — (n — 1)s,-) forn > 1. For J C [n — 1], let R; be the set 
of digraphs on [n] that contain paths from n to all other vertices and such that J is the set 
of vertices other than n that begin paths to vertex n. Let r; be the sum of the signs of the 
digraphs in Ry, Sor, = ory. 

When |J| = n — 1, R7 is the set of strongly connected digraphs on [n], and thus r7 = sy. 

When |J| = n — 2, let j be the sole vertex of [n — 1] — J. The signs of digraphs in R; 
that have an arc from J to j sum to 0, since adding or deleting the arc (n, 7) changes the 
sign without changing membership in this set. Thus we need to sum signs only for those 
digraphs in R; that have no arc from J to j. Such digraphs consist of a strongly connected 
digraph on [n] — {j} plus the arc (n, j), so the sum of their signs is —s,_}. With n — 1 
choices for j, the sum of the contributions when |J| = n — 2 is —(n — 1)5y_1. 

When |J| < n — 2, leti and j be the largest and second-largest elements of [n — 1] — J. 
Adding or deleting the arc (i, 7) changes the sign without changing membership in R;. 
Hence these contributions cancel, andr; = 0. 

We have proved that r, = Sy, — (n — 1)Sp,—1. Since s; = 1, the proof is completed by 
proving that r, = 0 forn > 1. For J C [n — 1], let Q; denote the set of digraphs on [n] in 
which I is the set of vertices other than n that are reachable by paths from vertex n. Let g; 
be the sum of the signs of the digraphs in Q7, SO rn = q[n—1}. 

When |J| < n — 1, leti be the largest element of [n — 1] — J. Adding or deleting the arc 
(i, n) changes the sign without changing membership in Q,. Thus qg; = 0. 

We conclude that 5°, q7 = rn. On the other hand, the sum of q; over all J is the sum 
of the signs of all digraphs on [n]. This sum is 0, since adding or deleting the arc (1, 2) 
changes the sign. Thus r, = 0, as desired. 


Composite solution II by Nikhil Bansal, Bombay, India, and the editors. Let e(D) denote 
the number of arcs in a digraph D. The claim follows from comparing two expressions for 
g(x), the exponential generating function for an = )-(—1)*, where the sum is over all 
digraphs on [n]. 

First, g(x) = )0,51 4nx"/n! = x, since the number of digraphs with’ even size equals 
the number with odd size when n > 1, while ay = (—1)° = 1. 

Alternatively, we group the contributions according to certain subdigraphs, counting 
those of even size minus those of odd size. The strong components of a digraph are the 
maximal strongly connected subdigraphs. The condensation D* of a digraph D is the 
digraph obtained by contracting strong components to single points. The condensation of a 
digraph with k strong components is an acyclic digraph with k vertices. 

To form a digraph on [n] having k fixed strong components with vertex sets of sizes 
Nj,...,Ng respectively, we assign vertices to components, place a strongly connected di- 
graph on each component, form an acyclic digraph on [k] as the condensation, and expand 
edges of the condensation into sets of edges between the corresponding strong components. 
We divide by k!, because strong components are distinguished by the names of their vertices, 
but the names of the components are, arbitrary. 

The contributions to parity within each strong component and within the expansion of 
each edge of the acyclic digraph are independent; the final digraph has even size if and only 
if the number of odd contributions is even. Thus we take the product, over each set of edges 
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to be included, of the number of ways to include it with even-size minus the number of 
ways to include it with odd size. Let bm» = Em — Om; this is the contribution for a strong 
component of order m. 
At this point we have 
n 


yen 22 > 3 oan ...Dn, 3 I] (Ei; — Oi;), 


k=l] nyt t+ngp=n Cc i,je() 


where the inner sum is over acyclic digraphs C on [k], the notation ey stands for the set 
of all 2-element subsets of [k], and E;; and O;;, respectively, denote the number of ways to 
have an even or odd number of edges between components i and j in the expansion of C. 

When i and j are not adjacent in C, there is no edge in the expansion, and E;; — O;; = 1. 
When i and j are adjacent, there must be at least one edge in the expansion, and all such edges 
agree in direction with the edge in C. Eliminating the empty set yields Ej; — Oj; = —1. 
With the factor —1 for each edge of C, the product is (—1)°. 

To further simplify the formula, we claim that })-(— 1)°) = (—1)*-!. We define an 
involution on the acyclic digraphs that pairs up digraphs with sizes differing by 1, and we 
show that the only unpaired digraph is the digraph C, with arc set {(k, j): 1 < j <k—1}. 
A source is a vertex with no incoming arc; a predecessor of j is a vertex i such that (i, /) 
is an arc. 7 

Every acyclic digraph C has at least one source. Let i be the least source vertex. When 
i +~ k, we add or delete the arc (i, k); it remains true that i is the least source. In the 
remaining digraphs, the only source vertex is k. For these, let j be the highest vertex having 
a predecessor other than k. Add or delete the arc (k, j). It remains true that k is the only 
source and that 7 is the highest vertex with a predecessor other than k. The only digraph in 
which j does not exist is Cy, (with k — 1 edges), which completes the claim. 

The coefficient of x” /n! in our generating function is now 


n (—1)*-! 
) [| a1; 


kal nyte-tngen (in 


Given the exponential generating function f(x) = byx"/n!, this yields g(x) = 1—e7J®), 
Since g(x) = x, we obtain f(x) = —In(1 — x), and hence by = (k — 1)!. 


Editorial comment. The proposer’s proof is adapted from an analysis of the unsigned case 
due to V. A. Liskovec (On a recurrence method of counting graphs with labelled vertices, 
Soviet Math. Dokl. 10 (1969) 242-256) and explicated by E. M. Wright (The number of 
strong digraphs, Bull. London Math. Soc. 3 (1971) 348-350). Problem 6673 [1991, 965; 
1994, 686] in this MonrtHLy is the analogous problem for undirected graphs. The proposer 
notes the following consequence. If the sign of a disjoint union of strongly connected 
(simple) digraphs is (—1)*+*, where e is the total number of arcs and k is the number of 
components, then the sum of the signs of all n-vertex disjoint unions of strongly connected 
digraphs is 0. He asks whether there is a simple direct proof of this corollary. One might 
also ask for a simple signed involution to prove the original claim directly. 


Solved also by R. J. Chapman (U. K.). 
Simultaneous Squares from Arithmetic Progressions 


10622 [1997, 870]. Proposed by M. N. Deshpande, Nagpur, India. Find infinitely many 
triples (a, b, c) of positive integers such that a, b, c are in arithmetic progression and such 
thatab+1,bc+ 1, and ca + 1 are perfect squares. 


Solution I by Hansruedi Widmer, Nussbaumen, Switzerland. Let ag9 = 1, a; = 4, and 
An4+2 = 44n41 — a, for n > O, and set Dy = 2an+ 1 and ch = An42. We claim that 
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(an, bn, Cn) meets the conditions of the problem. By the recurrence, cy, — by = by — an, 
and we have arithmetic progressions. 

To prove that a,c, + 1 is a perfect square, we prove by induction on n that ayadj42+1 = 
a? 41° For n = 0, we have 1-15 +1 = 4*. Forn > O, we use the recurrence and the 
induction hypothesis to compute 


Anan+2 +1 = 4anadn41 — a? + 1 = 4anan41 — Qn—14n41 = Cee 
Next, we use this formula and the recurrence to compute 
Anby + 1 = 2anany1 + 1 = 2andn41 +424) — Gndn42 = Ge) — An(Gn42 — 2an41) 
= aoe — Ay (2an41 — Gn) = (Qn41 — ain). 
Similarly bycy + 1 = (Gn42 — Gn+1)*, so all desired properties hold. 


Solution II by Zachary Franco, Butler University, Indianapolis, IN. In Z[V/3], the norm 
Ir + s/3]| = r* — 3s is multiplicative and satisfies ||2 + /3|| = 1. Therefore, ||(2 + 
/3)" || = 1. For the expansion (2 + /3)" —r-+s./3, we thus have 


352 = r* — 1, (*) 


Forn > 1, the triple (a, b, c) = (2s —r, 2s, 2s +r) is in arithmetic progression and satisfies 
(2s —r)2s+1=(r— s)*, (2s —r)(Qs+r)+1=s*, and 2s(2s+r)+1=(r+s)?. 


Editorial comment. The two solutions generate the same family of triples. Jan Kristian 
Haugland, John P. Robertson, and Ivan Vidav independently proved that this family contains 
all triples with a < b < c that satisfy the conditions of the problem. Betsy Carper, Gary 
Hull, Lenny Jones, and Bonnie Wachhaus observed that for each triple in this family there 
is no fourth integer d such that a, b, c, d are in arithmetic progression and both bd + 1 and 
cd + 1 are perfect squares. Hence there is no quadruple of positive integers in arithmetic 
progression such that ij + 1 is a perfect square for all i, 7 in the quadruple. 

David M. Bloom observed that this problem is related to problem 10238 [1992, 674; 
1995, 275]. The published solution for that problem used the Pell equation (+). 
Solved also by I. Adler, M. Aissen, J. Anglesio (France), B. D. Beasley, J. C. Binz (Switzerland), D. M. Bloom, J. Bowring, 
J. T. Bruening, S. Byrd, R. J. Chapman (U. K.), J. Christopher, C. R. Diminnie, R. DiSario, H. Y. Far, J. K. Haugland (Norway), 
R. Heller, L. Jones et al., N. Komanda, J. Lee, N. F. Lindquist, C. Mack, L. G. Mans, A. S. Mittal, G. R. Mott, Y. Pan, M. Reekie, 
J.P. Robertson, H. Sedinger, N.C. Singer, W. R. Smythe, P. Trajovsky (Czech Republic), I. Vidav (Slovenia), M. Vowe (Switzerland), 


X. Wang, C. H. Webster, P. Yiu, Anchorage Math Solutions Group, GCHQ Problems Group (U. K.), NSA Problems Group, NCCU 
Problems Group, SAS Maths Club (India), WMC Problems Group, and the proposer. 


The Divisibility Poset Inside Itself 


10623 [1997, 870]. Proposed by Roy Barbara, Lebanese University, Fanar, Lebanon. Let 
P = {1,2,3,...}, and let | be the usual divisibility relation on P. For any § C P and 
ne PilettS+tn={s+n:se S$}. 

(a) Can one construct a subset S of P such that the poset (S, |) is isomorphic to (P, |), 
(S + 1, |) is isomorphic to (P, <), and (S + 2, |) is isomorphic to (P, |)? 

(b) For which integers n > 1 can one find a subset T of P such that (T, |), (T +n, |), and 
(P, |) are isomorphic posets? 


Solution to (a) by Nasha Komanda, Central Michigan University, Mt. Pleasant, MI. The 
answer is yes. First we prove for positive integers x, y and for integers a 4 +1 that 


ged(a* — 1,a” — 1) = [a8 — 4), () 


We may assume that x > y. From a* — 1 = a*-Y(a” — 1) + (a*~ ” — 1) we obtain 
gcd(a* — 1,a” — 1) = ged(a” — 1, a*-” — 1). Also gcd(x, y) = gced(y, x — y). Thus (*) 
follows by induction on x + y. If also x and y are odd, then 


gcd(a” +lavy+l1)= gcd((—a)* —1,(-—a)” -1) = |(—a) ody) —l|= |qBd.y) +1]. 
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For positive integers a,x, y with a > 1, it follows that (a* — 1)|(a” — 1) if and only if 
x|y. When x and y are odd, we also have (a* + 1)|(a” + 1) if and only if x|y. 
Now let S = {2”°~!: x € P}. Forx, y € P, 


(27-1 — 1)|27—! = 1) <=> (2* — 1)|(2” — 1) = aly. 
Thus (5S, |) is isomorphic to (P, |). Also 
(27°—1)|(2?"—1) <=> (2* — 1) < (2-1) x <y. 
Thus (S + 1, |) is isomorphic to (P, <). Since 2* — 1 and 2” — 1 are odd, we have 
(27-1 4 1927-1 +1) > 2* — 1)|(2” — 1) <> xy. 
Thus (S + 2, |) is isomorphic to (P, |). 


Solution to (b) by Robin J. Chapman, University of Exeter, Exeter, U. K. Such a T exists for 
alln € P. Let O = {1,3,5,...} be the set of odd positive integers. Let p; denote the j-th 
smallest prime. Then ¢ : I]; p; —> I]; Pe , 1s an isomorphism from (P, |) to (Q, |). 

Givenn € P, let T =nO = {nm:meé O}. ThenT +n = 2nP = (2nk:k € P}. 
The posets (7, |) and (O, |) are isomorphic, and (T +n, |) and (P, |) are isomorphic. We 
obtain (T, |) isomorphic to (T +7, |) by transitivity. 


Part (a) solved also by R. J. Chapman (U. K.). Both parts solved also by J. Dawson (Australia), GCHQ Problems Group (U. K.), 
and the proposer. 


An Equation Involving the Totient 


10626 [1997, 871]. Proposed by Florian Luca, Syracuse University, Syracuse, NY. For a 
positive integer k, the number of positive integers less than k that are relatively prime to k 
is denoted #(k). 

(a) Show that if m and n are relatively prime positive integers, then @(5” — 1) 4 5” — 1. 
(b)* Find all positive integers m, n such that 6(5” — 1) = 5” — 1. 


Solution to (a) by Nasha Komanda, Central Michigan University, Mt. Pleasant, MI. We use 
the lemma proved in Problem 10623: gcd(a” — 1,a” — 1) = qed.") _ 1 when a, m,n 
are positive integers. 

When gcd(m,n) = 1, the lemma yes gcd(5” — 1,5" — 1) = 4. When 5” —- 1 
has the prime factorization 2 | [j_ p; “! with all exponents positive, we have 6(5” — 1) = 
=! [je p= (pi —1). Suppose that @(5” —1) = 5”—1. Since ged(5” —1, 5”—1) = 4, 
we have e; = 1 for] <i <s. 

If m is even, then 5” — 1 = 25/2 — 1 is divisible by 8. Therefore, eg > 3. Since m and 
n are relatively prime, n is odd, so 5” — 1 = 4 (mod 8), which implies that e9 —1+ 5 < 2. 
Thus ég = 3 and s = 0, which yields the impossibility 5” — 1 = 8. 

Thus m is odd, which yields e9 = 2. Since e; = 1 for1 <i < 5s, we have 5” —1 = 
4 [Tet pi. Thus 5” = 1 (mod p;) for 1 <i <s. Since m 1s odd, 5 isa quadratic residue 
modulo p;. By the Quadratic Reciprocity Law, each p; is a quadratic residue modulo 5. 
Thus p; = +1 (mod 5). If pj = 1 (mod 5), then p; —1 = 0 (mod 5) and 5” —1 = 0 (mod 5), 
which is impossible. Therefore, pj = —1 (mod 5) for 1 <i < s. Our formula for 5” — 1 
now yields —1 = (—1)**! (mod 5), and hence s is even. 

On the other hand, e; = 1 for 1 <i < s also yields 5” — 1 = 2] [}_, (pi — 1). With 
Pi = —1 (mod 5), this yields —1 = 2(—2)* (mod 5). This requires s = 3 (mod 4), which 
contradicts our conclusion that s is even. Thus m can be neither odd nor even, and no 
solution exists. 


_ Editorial comment. No solution was received to part (b). Roy Barbara provided partial 
results about the general equation ¢(a” — 1) = a” — 1, with additional partial results in 
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the special case a = 5. When a is even, there is no solution except (a,m,n) = (2, 1, 1). 
When a is odd and at least 3, there is no solution when n is odd, when m is a power of 2, or 
when m is divisible by at least as high a power of 2 as n. 

When a = 5, further exclusions restrict the possibilities for (m, n) to {(2* p, 2*q): pis 
odd, q is even, and 0 < k < 4}. Furthermore, such a solution requires the five equations 


$(5” ? —1) = 52/4 — 1 for 0 < j <4. This is highly restrictive; there may be no solution. 


Part (a) solved also by H. Salle, GCHQ Problems Group, and the proposer. | 


Three Congruent Circles Between Two Triangles 


10659 [1998, 366]. Proposed by Jiro Fukuta, Shinsei-cho, Gifu-ken, Japan. Let D, E, F 
be points in the interior of sides BC, CA, AB, respectively, of triangle AABC such that 
the incircles of AAEF, ABFD, and ACDE are congruent, each having radius r. Let 
p, s, and K be the inradius, semiperimeter, and area of AABC, and p’, s’, and K’ be the 
corresponding quantities for ADEF. 

(a) Prove that p’ = p —r,s’ = (1—r/p)s, and K’ = (l—r/p)*K 

(b) Prove that, if r = o/2, then D, E, and F are midpoints of the sides of AABC. 


Solution of part (a) by GCHQ Problems Group, Cheltenham, U. K. Let the incentres of 
AAEF, ABFD, and ACDE be Oj, O2, and O3 respectively. We first show that the 
triangles AO; O203 and ADE F have the same area and same perimeter. 

Let the incircle of AAE F touch AF at T and EF at U. Let the incircle of ABDF touch 
BF at V and DF at W. Then FT = FU and FV =FW,soFU+FW=FT+FV= 
TV = O; Oy. Continuing this process around ADE F shows that it has the same perimeter 
as AO; 0203. 

Let G be the foot of the perpendicular from F to O;Q>2 and let H be the intersec- 
tion of O;O2 and EF. Then Area(AFGO,) = Area(AFTO,) = Area(AFUO}), so 
Area(AF GH) = Area(AO;U#H). Continuing in this manner shows that ADE F has the 
same area as AO; 0203. The result is true whether or not line segments FG and O2W 
intersect. Since the two triangles have the same area and the same perimeter, they must also 
have the same inradius, namely p’. 

Now AO, 0203 is homothetic to AABC, with the incentre of AABC as the centre of 
homothety. It follows that o’ = p —r. The scaling factor from AABC to AO; 0203 is 


p'/p, sos’ = (p'/p)s = (1 —r/p)s and K' = (p'/p)*K = (1—r/p)*K as required. 


Solution of part (b) by Calin Popescu, Université Catholique de Louvain, Louvain-la-Neuve, 
Belgium. For brevity, let us call a triple of points D, E, F with the properties described in 
the statement of the problem r-adequate. The midpoints Do, Eo, Fo of the sides BC, CA, 
AB, respectively, form a p/2-adequate triple; we have to show that it is the only such triple. 
To this end, let D, E, and F form a p/2-adequate triple, and note that K’/K = 1/4, by 
part (a). Observe that BD/BC, CE/CA, and AF/AB can be written as 1/2+.x,1/2+y, 
and 1/2 + z, respectively, with x, y, and z either all in (—1/2, O] or all in [0, 1/2). Indeed, 
if say x < O < y, then clearly the inradius of ACDE would be larger than the inradius 
of AC Do£o, which is impossible since both are p/2. Writing K,4, Kg, Kc for the areas 
of AAEF, ABFD, ACDE, respectively, we obtain K4/K = (AE/AC)(AF/AB) = 
(1/2—y)Q1/2+z) = 1/44+(z—y)/2—yzand similarly Kp/K = 1/4+(x* —z)/2—zx and 
Kc/K = 1/4+(y—x)/2—xy. It follows that K’/K =1-—(K4/K+Kgp/K+Kc/K) = 
1/4+ xy + yz+ zx. Since x, y, and z are either all nonnegative or all nonpositive, 
K'/K = 1/4 forces x = y = z = 0. Hence D, E, and F are indeed the midpoints of the 
sides of the triangle ABC. 


Solved also by T. Hermann, N. Lakshmanan, G. Peng, C. R. Pranesachar (India), A. Sasane (The Netherlands), and the proposer. 
Part (a) solved also by R. Barbara (France), F. Bellot Rosado (Spain), J. Dou (Spain), P. E. Nuesch (Switzerland), C. Popescu 
(Belgium), W. Reyes (Chile), R. A. Simon (Chile), and I. Sofair. 
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Quadrilateral Center of Gravity 


10662 [1998, 464]. Proposed by Joseph D. E. Konhauser and Stan Wagon, Macalester 
College, St. Paul, MN. Find a construction for the center of gravity of the edges of a 
quadrilateral. 


Solution by the Con Amore Problems Group, Royal Danish School of Educational Studies, 
~ Copenhagen, Denmark. If G is the center of gravity of the edges of the quadrilateral ABC D 
then G is also the center of gravity of particles with masses proportional to the lengths of 
the edges AB, BC, CD, DA placed at the midpoints P, Q, R, S of these edges. Construct 
these midpoints. The center of gravity for the particles at P and R is the point T on PR 
such that PT : TR = CD: AB, and the center of gravity for the particles at Q and S is the 
point U on QS with QU : US = DA: BC, so we construct the points T and U. If T and 
U coincide, we have G = T = U. If not, then G is the center of gravity of particles with 
masses proportional to the lengths of the line segments AB + CD and BC + DA placed at 
T and U, respectively. The sum of two line segments is constructed by placing them end 
to end. Thus G is the point on TU with TG : GU = (BC + DA) : (AB+ CD), and we 
construct this point. 


- Solved also by M. Benedicty, M. Boase (U. K.), G. D. Chakerian, R. J. Chapman (U. K.), S. S. Kim (Korea), J. H. Lindsey II, A. 
Nijenhuis, V. Pambuccian, C. R. Pranesachar (India), A. Sasane (The Netherlands), J. Schaer (Canada), Anchorage Math Solutions 
Group, GCHQ Problems Group, and the proposers. 


Logarithmic Convexity of Stirling’s Ratio 


10680 [1998, 666]. Proposed by Harold G. Diamond, University of Illinois, Urbana, IL. 
For x > 0 set g(x) = x log (xe + 1)/(x*e~* J2mx)). Show that g is concave down on 
(O, co). 

Solution by Nathaniel Grossman, University of California, Los Angeles, CA. It is enough to 


show that g”(x) <0 when x > 0. We begin with Binet’s second expression for the gamma 
function, which we write in the form 


1 ae | 
logla+1)=(a+ 5) logx —x + 5 log(27) + 2k(x), () 


where k(x) = i arctan(t /x)(e2"* — 1)~! dt (M. Abramowitz and I. Stegun, Handbook of 
Mathematical Functions, Dover, New York, 1972, p. 258 (6.1.50)). From (*) we find that 
g(x) = 2xk(x), hence g’(x) = 2 (xk (x) + 2k’(x)). Easily justified differentiation under 
the integral sign leads to 

3 
ere d t, 
(x2 A t*)2(e2nt - 1) 


in which the right hand side is clearly negative. 


xk” (x) + 2k'(x) = -[- 
0 


Solved also by J. Anglesio (France), P. Bracken (Canada), D. Bradley, E. Camouzis (Greece), R. J. Chapman (U. K.), R. A. Groen- 
eveld, D. Krug, O. P. Lossers (The Netherlands), R. Martin (U. K.), A. McD. Mercer (Canada), P. Simeonoy, A. Stadler (Switzerland), 
. and NCCU Problems Group. 
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REVIEWS 


Edited by Harold P. Boas 
Mathematics Department, Texas A & M University, College Station, TX 77843-3368 


African Fractals: Modern Computing and Indigenous Design. By Ron Eglash. Rutgers 
University Press, 1999, ix + 258 pp., $25. 


Reviewed by James V. Rauff 


Ethnomathematics has enjoyed some fame and much notoriety in mathematical 
circles. Enthusiasts praise the recognition of mathematical ideas in non-western 
cultures, while critics bemoan the perceived lack of rigor and the tendency toward 
postmodernist excesses. Both camps are sure to cite Ron Eglash’s African Fractals 
as evidence for their arguments. 

Eglash has assembled a wide variety of ethnographic data, political-cultural 
commentary, and mathematics into an arresting tale of discovery and extrapola- 
tion. This book presents a roller-coaster ride through a broad range of academic 
disciplines, mostly not mathematical. To give the reader a feel for the experience 
of reading African Fractals, I will follow Eglash’s organization. 

The book begins with a short discussion of the nature of fractal geometry, the 
‘five essential components” being recursion, scaling, self-similarity, infinity, and 
fractal dimension. Eglash includes the standard illustrative examples common to 
books dealing with fractals and chaotic systems: the Cantor middle-third set and 
the Koch curve. He uses the five essential components as metrics to help deter- 
mine which of the wide range of African designs and knowledge systems are 
indeed instances of fractal geometry. Although the introductory chapter is not 
particularly compelling in itself, it nicely sets up the next chapter, the most 
appealing part of African Fractals. 

Aerial photography holds a magical power for many people, myself included, 
and Eglash wields his overhead views of African architecture like a wizard. Pairing 
the photographs with computer-generated drawings, Eglash argues persuasively 
that there is indeed fractal geometry at work in the design of African settlements. 
Seeing the pictures, no one can doubt that the palace of the chief in Logone-Birni 
and the Ba-ila settlement are recursively generated and possess self-similarity. 
African Fractals is worth the $25 price of admission if all the reader does is gaze at 
these photographs and the accompanying computer-generated fractal patterns. 

The architectural evidence for fractal geometry in some African cultures is 
convincing, but African Fractals goes on to suggest a deeper and more pervasive 
fractal character to African culture. First, however, Eglash must deal with two 
issues. 

Is fractal geometry the result of some universal human characteristic, or does it 
arise only in certain cultures? In Chapter Three, Eglash points out that we do not 
see fractals in the settlkement architecture of either Native America or Europe. 
This chapter is the weakest in the book, for it distracts the reader from the 
main flow of the discussion of African culture. Moreover, I lost some faith in 


872 REVIEWS [Monthly 106 


the narrative because of two glaring errors in this chapter, particularly trouble- 
some after the outstanding images in the previous chapter. First, Eglash attrib- 
utes the construction of Teotihuacan in Mexico to the Maya (p. 42). While 
there is considerable evidence of Teotihuacano influence and trade with the 
Maya world, particularly at Copan, it is not generally accepted that Teotihuacan 
was a Maya city; for more information, visit the Teotihuacan website 
http://archaeology.la.asu.edu/ téo / maintained by Arizona State Uni- 
versity. The second major factual error is Eglash’s attribution of the mathematical 
analysis of Warlpiri sand drawings to Marcia Ascher in her book Ethnomathemat- 
ics [1]. While Ascher does discuss Warlpiri kinship systems, she has nothing to say 
about the “algorithmic properties” of their sand drawings. Apparently, Eglash has 
confused the Warlpiri, inhabitants of continental Australia, with the Malekula, 
whose sand drawings Ascher does analyze. The only publications about the 
mathematical properties of Warlpiri sand drawings that I am aware of are the two 
notes [5] and [7]. 

The second issue that Eglash must deal with before proceeding is a central one 
for ethnomathematics: what constitutes mathematics in culture, and what does 
not? Eglash does a very nice job of delineating and classifying the possibilities, and 
he is careful to apply these categories to his subsequent analysis of fractal designs 
in African culture. Fractal designs of a group of people might be understood in 
different ways, ranging from unintentional by-products of some other activity to 
intentional designs whose character may be either implicitly or explicitly realized. 
This continuum is central to a raging debate in ethnomathematics. Does the 
existence of mathematical structures in a culture in and of itself count as mathe- 
matics, or does the mathematical creation need to be intentional? Eglash positions 
himself firmly in the camp that requires intentionality. 

Specific examples of African designs that exhibit one or more of the “five 
essential components of fractal geometry” constitute the second part of African 
Fractals. Eglash also places the examples on the continuum of intentionality. The 
following list gives an idea of the breadth of Eglash’s work. 


Geometric algorithms. These include Mangbetu iterative squares sculptures, and 
Chokwe sand drawings that are intentionally designed as Eulerian paths. 


Scaling geometry. Examples include windscreen designs showing power-law scal- 
ing, kente cloth stretching, logarithmic scaling in Ghanaian designs, and adaptive 
scaling in hairstyles. The discussion of windscreen designs powerfully affirms the 
sophistication of traditional engineering and architecture. 


Numeric systems. Eglash shows that Bamana sand divination is similar to pseudo- 
random number generation using shift registers, and he finds self-organization in 
the board game Owari by viewing it as a one-dimensional cellular automaton. 


Recursion. African culture is permeated by recursive design and recursive knowl- 
edge systems. We see many examples drawn from religion, dance, kinship, sculp- 
ture, and weaving. Iteration and self-reference abound. After reading this 
jam-packed chapter, no one will deny the potency of recursion in African thought. 


Infinity. African knowledge systems use infinity “in the sense of a progression 
without limit’ and represent it iconographically as a “completed whole.” I was 
unsatisfied with this discussion of infinity, but, in fairness, I must say that to do the 
topic justice requires a book-length essay like Jadran Mimica’s splendid treatment 
of Iqwaye counting [4]. 
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Complexity. In a rapid dash through some basic notions of cybernetics and an 
idiosyncratic look at the Chomsky hierarchy of formal languages, Eglash argues for 
the existence of feedback loops and self-organization in African cultures arising 
from group intentionality. 


The following statement summarizes the second part of the book: ‘“‘four of the 
five basic concepts of fractal geometry—scaling, self-similarity, recursion and 
infinity—are all potent aspects of African mathematics,” but “a quantitative 
measure of dimension is completely absent.” Thus ends the more mathematical 
portion of African Fractals. One may quibble about some of the assertions (for 
instance, “mathematical complexity theory is based upon fractal geometry,” which 
confuses chaotic dynamical systems with the theory of computation), about some 
omissions (the largest single source for the study of Tusona [3] is not even cited), 
and about flawed editing (Archimedean is misspelled on p. 76 and the shift- 
register example in Figure 7.2 has missing and incorrect entries), but the con- 
tention that fractals exist in African cultural artifacts and are intentionally and 
algorithmically produced is undeniably established. 

Mathematicians are accustomed to books ending with suggestions for further 
research or applications, so they will be unprepared for the four chapters of 
humanistic “implications” that conclude African Fractals. Eglash wants us to 
consider what we should make of the presence of fractal geometry in African 
culture. He wants to get beyond the description of the fractal designs and 
knowledge systems and into the deeper cultural meanings. He eschews a particular 
epistemological framework and opts for a “toolbox” of approaches. One important 
tool is what he calls participant simulation, meaning the collaboration in mathe- 
matical analysis between ethnographer and informant. 

Subsequent pages treat a variety of issues that are of some concern to ethno- 
mathematicians, Africanists, and social theorists. In a discussion of the politics of 
recursion, Eglash states that “self-organization is not necessarily liberating; it can 
serve to support social control rather than resist it.” He also suggests that 
European colonists may have failed to recognize African cities as such because the 
cities were organized by fractal rather than Cartesian principles. Arguing for a 
relation between recursion and sexuality in culture, Eglash invokes Ada Lovelace 
and Alan Turing. The bottom line of these arguments, as put forward by Paul 
Ernest [2] and much earlier by Raymond Wilder [8], is that mathematical knowl- 
edge is a social construction. This is an important point for ethnomathematics. It 
must be the case that mathematical knowledge is dependent upon culture and that 
mathematical ideas may develop differently in different cultures before we may 
even consider that there is an ethnomathematics. 

Eglash’s assertions will appeal to those who are sympathetic to constructivist 
theories of mathematics, but will not convince readers unfamiliar with anthropo- 
logical and cultural argumentation. His tendency to assert what some other scholar 
was thinking (pp. 193, 202, and 213) may be off-putting to those unaccustomed to 
this kind of analysis. I must also admit to being a little skeptical of the suggestion - 
that Cantor got the idea of the middle-third set through his cousin Moritz from a 
design on an ancient Egyptian column (pp. 207-208). 

The book closes with some observations and speculations on the future for 
African fractals. Eglash observes that fractal design has become part of modern 
African architecture. Of special interest to mathematics educators is the discussion 
of how knowledge of the prevalence of fractal geometry in African culture should 
influence how mathematics is taught. Perhaps the most important result coming 
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out of ethnomathematical research is that culturally informed mathematics materi- 
als are successful; see the short survey and bibliography in [6]. 

African Fractals is not a perfect book, but it is a book that mathematicians 
should take a look at. It is the first book to treat a single mathematical notion from 
the point of view of African culture, and it raises several important questions about 
exactly what should be and can be considered mathematics. It is a good place to 
see what ethnomathematics is all about, for it represents the better side of 
ethnomathematical research. Even if the anthropology and the mathematical and 
cultural philosophy seem unappealing, you should pick up a copy of African 
Fractals if only to wonder at the aerial photographs of those fractal African 
settlements. 
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TELEGRAPHIC REVIEWS 


Edited by Arnold Ostebee 


with the assistance of the Mathematics Departments of 
Carleton, Macalester, and St. Olaf Colleges — 


Telegraphic Reviews are designed to alert readers in a timely manner to new books 
appropriate to mathematics teaching and research. Special codes classify reviews by 


subject area and appropriate use: 


T : Textbook 
C : Computer Software 


S : Supplementary Reading 


Readers are advised that price information is subject to change. 
receive a second, more extensive review in the Monthly. 


P : Professional Reading 
L : Undergraduate Library 
13: Grade Level 


1—4: Semester 

** + Special Emphasis 

?? : Questionable 
Selected books 


Books submitted for review should be sent to Book Reviews Editor, American Mathe- 
matical Monthly, St. Olaf College, 1520 St. Olaf Avenue, Northfield, MN 55057-1098. 


General, L. You Can’t Get Lost On a Golf 
Course. Francis Scheid. Jones River Pr 
(Plymouth, MA 02360), 1998, x + 158 pp, 
$16.95 (P). [ISBN 1-888964-03-0] Fictional 
dialogs between ‘student’ and ‘professor’ on 
golf with mathematical digressions. Golf 
course topology is mostly graph theoretic. SN 


General, P. Spectral Problems in Geometry 
and Arithmetic. Ed: Thomas Branson. Con- 
temp. Math., V., 237. AMS, 1999, xi+ 174 pp, 
$37 (P). [ISBN 0-8218-0940-7] Proceedings 
of a 1997 NSF-CBMS conference at the Uni- 
versity of Iowa. 


General, P. Proceedings of the St. Peters- 
burg Mathematical Society, Volume V. Ed: N.N. 
Uraltseva. AMS Transl., Ser. 2, V. 193. AMS, 
1999, vii+ 271 pp, $99. [ISBN 0-8218-1390-0] 


Education, P. Mathematics in the Middle. Ed: 
Larry Leutzinger. NCTM, 1998, vi + 218 pp, 
$26 (P). [ISBN 0-87353-460-3] 28 papers 
outline issues, present Standards-based curric- 
ula, and describe projects and activities for 
middle-grades classrooms. 


Logic, P. Truth, Proof and Infinity: A The- 
ory of Constructions and Constructive Reason- 
ing. Peter Fletcher. Synthese Library, V. 276. 
Kluwer Academic, 1998, ix + 469 pp, $157. 
[ISBN 0-7923-5262-9] Develops precise def- 
initions of construction and proof in construc- 
tive mathematics, and describes algorithmic un- 
derpinnings of intuitionist logic. For logicians, 
philosophers of mathematics, theoretical com- 
puter scientists. LB 


Logic, P. Advances in Contemporary Logic and 
Computer Science. Eds: Walter A. Carnielli, 


876 


TELEGRAPHIC REVIEWS 


Itala M.L. D’Ottaviano. Contemp. Math., 
V. 235. AMS, 1999, xi + 326 pp, $60 (P). 
[ISBN 0-8218-1364-1] Proceedings of the 
Eleventh Brazilian Conference on Mathemat- 
ical Logic held in 1996 in Salvador, Bahia, 
Brazil. 


Combinatorics, P, L. Graph Classes: A Sur- 
vey. Andreas Brandstadt, Van Bang Le, Jeremy 
P. Spinrad. Mono. on Disc. Math. & Applic. 
SIAM, 1999, xi + 304 pp, $68 (P). [ISBN 
0-89871-432-X] Survey of over 200 graph 
classes with emphasis on algorithmic point of 
view; results given without proof. A distillation 
of over a thousand cited references. SN 


Number Theory, P. Topics in Number The- 
ory: In Honor of B. Gordon and S. Chowla. 
Eds: Scott D. Ahlgren, George E. Andrews, Ken 
Ono. Math. & Its Applic., V. 467. Kluwer Aca- 
demic, 1999, viii + 258 pp, $120. [ISBN 0- 
7923-5583-0] Papers from a 1997 conference 
at the Pennsylvania State University. 


Number Theory, P.  Automorphic Forms, 
Automorphic Representations, and Arithmetic. 
Eds: Robert S. Doran, Ze-Li Dou, George T. 
Gilbert. Proc. of Symposia in Pure Math., V. 66. 
AMS, 1999, $121 set, [ISBN 0-8218-0659-9] 
set. Part 1, xi + 278 pp; Part 2, xi + 330 pp. 
Proceedings of the NSF-CBMS Regional Con- 
ference on Euler Products and Eisenstein Series 
held at Texas Christian University in 1996. 

Linear Algebra, T(16-17), P. Orthogonal Sets 
and Polar Methods in Linear Algebra: Applica- 
tions to Matrix Calculations, Systems of Equa- 


tions, Inequalities, and Linear Programming. 
Enrique Castillo, et al. Pure & Appl. Math. Wi- 
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ley, 1999, xii + 422 pp, $89.95. [ISBN 0-471- 
32889-8] Presents methods for solving a va- 
riety of linear algebra problems based on prop- 
erties of orthogonal sets and dual cones. AO 


Group Theory, T(18), P. Completely Reg- 
ular Semigroups. Mario Petrich, Norman R. 
Reilly. Canadian Math. Soc. Ser. of Mono. & 
Adv. Texts, V. 23. Wiley, 1999, x + 481 pp, 
$115. [ISBN 0-471-19571-5] 


Calculus, S(14), C. Multivariable Calculus 
and Mathematics: With Applications to Geom- 
etry and Physics. Kevin R. Coombes, Ronald 
L. Lipsman, Jonathan M. Rosenberg. Springer- 
Verlag, 1998, xiii + 283 pp, $34.95 (P), with 
disk. [ISBN 0-387-98360-0] Some chapters 
discuss standard material from an unusual point 
of view; others present topics that are difficult 
to address without a computer. Physical appli- 
cations (e.g., Kepler’s laws, electromagnetism, 
fluid flow) and elementary ideas from differen- 
tial geometry play a prominent role. Exercises 
exploit the graphical, symbolic, and numerical 
capabilities of Mathematica. Presumes famil- 
iarity with Mathematica basics. AO 


Calculus, T(13: 2), C. Calculus: The Lan- 
guage of Change, Second Edition. Keith D. 
Stroyan. Academic Pr, 1998, xix +555 pp, with 
CD-ROM. [ISBN 0-12-673030-X]; Projects for 
Calculus: The Language of Change, Second 
Edition, 1999, xii + 326 pp, (P). [ISBN 0- 
12-673031-8] First Edition was entitled Cal- 
culus Using Mathematica (TR, February 1994). 
Text now accommodates use of either Maple or 
Mathematica. Projects book contains 52 open- 
ended projects on applications in mathemat- 
ics, economics, science, and engineering. CD- 
ROM includes Mathematica Notebooks and 
Maple Worksheets as well as optional material 
not in the printed text. AO 


Calculus, T(13: 2). Calculus: Applications and 
Technology For Business, Social and Life Sci- 
ences, Second Edition. Edmond C. Tomastik. 
Saunders College, 1999, xxiv + 533 pp, $48 (P). 
[ISBN 0-03-021559-5] Based on the author’s 
text Applied Calculus With Technology. This 
edition adds material on modeling using regres- 
sion analysis, includes more referenced (i.e., 
“real life’) examples and exercises, and places 
greater emphasis on the use of technology. AO 


Complex Analysis, P. Convex and Starlike 
Mappings in Several Complex Variables. Sheng 
Gong. Math. & Its Applic., V. 435. Kluwer 
Academic, 1998, xiii + 199 pp, $89. [ISBN 
0-7923-4964-4] 


. Partial Differential Equations, P. Global So- 
lutions of Nonlinear Schrodinger Equations. J. 
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Bourgain. Colloquium Pub., V. 46. AMS, 1999, 
viii + 182 pp, $35. [ISBN 0-8218-1919-4] 


Partial Differential Equations, 1(17-18). 
Linear and Nonlinear Waves. G.B. Whitham. 
Pure & Appl. Math. Wiley, 1999, xvi + 636 pp, 
$64.95 (P). [ISBN 0-471-35942-4] Paper- 
back edition of 1974 text (TR, February 1975). 


Partial Differential Equations, T(15-17). 
Numerical Solution of Partial Differential 
Equations in Science and Engineering. Leon 
Lapidus, George F. Pinder. Wiley, 1999, xiv 
+ 677 pp, (P). [ISBN 0-471-35944-0] Paper- 
back edition of 1982 text (TR, January 1983). 


Partial Differential Equations, T(17-18). 
Numerical Partial Differential Equations: Con- 
servation Laws and Elliptic Equations. J.W. 
Thomas. Texts in Appl. Math., V. 33. Springer- 
Verlag, 1999, xxii + 556 pp, $59.95. [ISBN 0- 
387-98346-5] From the Preface: “... includes 
stability results for difference schemes for solv- 
ing initial-boundary-value problems, analytic 
and numerical results for both scalar and sys- 
tems of conservation laws, numerical meth- 
ods for elliptic equations and an introduction 
to methods for irregularly shaped regions and 
for computation problems that need grid refine- 
ments.” 


Operator Theory, P. Singular Quadratic 
Forms in Perturbation Theory. Volodymyr 
Koshmanenko. Math. & Its Applic., V. 474. 
Kluwer Academic, 1999, viii + 308 pp, $165. 
[ISBN 0-7923-5625-X] 


Analysis, P. Continued Fractions: From Ana- 
lytic Number Theory to Constructive Approxi- 
mation. Eds: Bruce C. Berndt, Fritz Gesztesy. 
Contemp. Math., V. 236. AMS, 1999, xviii + 
379 pp, $78 (P). [ISBN 0-8218-1200-9] Pro- 
ceedings of a 1998 conference held at the Uni- 
versity of Missouri—Columbia. 


Topology, P. Low Dimensional Topology. Ed: 
Hanna Nencka. Contemp. Math., V. 233. AMS, 
1999, xi + 249 pp, $56 (P). [ISBN 0-8218- 
0884-2] Proceedings of a 1998 conference 
held in Funchal, Madeira, Portugal. 


Optimization, T(17), P. Network Optimiza- 
tion: Continuous and Discrete Models. Dimitri 
P. Bertsekas. Athena Scientific, 1998, xii + 
593 pp, $74. [ISBN 1-886529-02-7] Intro- 
ductory treatment of linear, nonlinear, and dis- 
crete network optimization problems. Theoret- 
ical results, algorithms, and applications. AO 


Optimization, T(17-18), P. Integer and Com- 
binatorial Optimization. George Nemhauser, 
Laurence Wolsey. Ser. in Disc. Math. & Op- 
tim. Wiley, 1999, xiv + 763 pp, $74.95 (P). 
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[ISBN 0-471-35943-2] Paperback edition of 
1988 text (TR, February 1989). 


Optimization, T(17-18), P. Iterative Meth- 
ods for Optimization. C.T. Kelley. Frontiers 
in Appl. Math. SIAM, 1999, xv + 180 pp, 
$37 (P). [ISBN 0-89871-433-8] Introduction 
to numerical methods for unconstrained and 
bound-constrained optimization problems. Fo- 
cuses on a few methods that convey essential 
ideas. Discusses deterministic sampling algo- 
rithms (e.g., Hooke-Jeeves, implicit filtering, 
MDS) as well as gradient-based techniques. AO 


Game Theory, P. Avances en Teoride Jue- 
gos con Aplicaciones Econémicas Y Sociales. 
Eds: J.M. Bilbao, ER. Fernandez. Universidad 
de Sevilla (Porvenir, 27-41013 Sevilla, Spain), 
1999, ix + 254 pp, (P). [ISBN 84-472-0517-7] 


Mathematical Modeling, T(15-17). Math- 
ematical Modeling, Second Edition. Mark 
M. Meerschaert. Academic Pr, 1999, xvi + 
351 pp. [ISBN 0-12-487652-8] New sections 
on discrete modeling (including integer pro- 
gramming) and on chaos and fractals. Three 
parts discuss optimization models, dynamical 
models, and probability models. (First Edition, 
TR, December 1993.) AO 


Optimal Control, P. Nonlinear Analysis, 
Differential Equations and Control. — Eds: 
FH. Clarke, R.J. Stern. NATO Sci. Ser. C, 
V. 528. Kluwer Academic, 1999, xix + 602 pp, 
$297. [ISBN 0-7923-5665-9] Proceedings of 
a 1998 NATO Advanced Study Institute held in 
Montréal, Canada. 


Optimal Control, T(16—-17: 1, 2), S, P. Fuzzy 
Control of Industrial Systems: Theory and Ap- 
plications. Tan S. Shaw. Kluwer Academic, 
1998, xxili + 192 pp, $115. [ISBN 0-7923- 
8249-8] Includes background on fuzzy sets 
and fuzzy logic, but assumes familiarity with 
linear continuous and discrete systems, feed- 
back, basic control theory. Covers fuzzy con- 
trollers, rule-based systems, stability analysis, 
neurofuzzy controllers, practical development 
issues, collaborative control systems (fuzzy 
controller plus human operator). No exercises 
in text, but questions over each chapter are avail- 
able on request from the author. LB 


Stochastic Processes, P. Advances in Stochas- 
tic Inequalities. Eds: Theodore P. Hill, Chris- 
tian Houdré. Contemp. Math., V. 234. AMS, 
1999, ix + 212 pp, $45 (P). [ISBN 0-8218- 
1086-3] Papers from a Special Session at the 
AMS meeting held at the Georgia Institute of 
Technology in 1997. 


Stochastic Processes, T(16—17), P. Stochas- 
tic Processes. Emanuel Parzen. _ Classics in 
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Appl. Math., V. 24. SIAM, 1999, xix + 324 pp, 
$37 (P). [ISBN 0-89871-441-9] Unabridged, 
corrected republication. Originally published 
in 1962 by Holden-Day. 


Stochastic Processes, P. Limit Theorems for 
Random Fields with Singular Spectrum. Niko- 
lai Leonenko. Math. & Its Applic., V. 465. 
Kluwer Academic, 1999, viii + 401 pp, $192. 
[ISBN 0-7923-5635-7] 


Elementary Statistics, T(13: 1), C. Statis- 
tics: An Introduction, Fifth Edition. Robert D. 
Mason, Douglas A. Lind, William G. Marchal. 
Duxbury Pr, 1998, xxii + 689 pp, $79.95, with 
disk. [ISBN 0-534-35379-7] Added in this 
edition: Excel spreadsheet examples, a chapter 
on quality control, and a section on box plots. 
(Fourth Edition, TR, ‘November 1994.) KB 


Elementary Statistics, T(14: 1). Applied 
Statistics and Probability for Engineers, Sec- 
ond Edition. Douglas C. Montgomery, George 
C. Runger. Wiley, 1999, xiv + 917 pp, $103.95. 
[ISBN 0-471-17027-5] Stresses methodology 
and applications rather than mathematical the- 
ory. Uses output from modern statistical soft- 
ware in examples. All text data are available in 
electronic form. HS 


Statistical Methods, T(17-18: 1), P, L. 
Model Selection and Inference: A Practical 
Information-Theoretic Approach. Kenneth P. 
Burnham, David R. Anderson.  Springer- 
Verlag, 1998, xx + 353 pp, $69.95. [ISBN 
0-387-98504-2] Introduction to analysis of 
empirical data using information-theoretic ap- 
proaches. Presents a consistent methodology 
that treats model formulation, model selection, 
estimation of model parameters, and their un- 
certainty in a unified manner. Reviews other in- 
formation criteria (e.g., AIC,, QAIC, TIC) and 
presents several new approaches to estimating 
model selection uncertainty and incorporating 
this uncertainty into estimates of precision. Ex- 
amples (primarily ecological) illustrate techni- 
cal issues. KB 


Statistical Methods, S(15-17), P. Select- 
ing and Ordering Populations: A New Sta- 
tistical Methodology. Jean Dickinson Gib- 
bons, Ingram Olkin, Milton Sobel. Classics 
in Appl. Math., V. 26. SIAM, 1999, xxv + 
569 pp, $59.50 (P). [ISBN 0-89871-439-7] 
Unabridged, corrected republication. (1977 
Wiley edition, TR, April 1978.) 


Statistical Methods, T(18), P. Fractional Fac- 
torial Plans. Aloke Dey, Rahul Mukerjee. Ser. 
in Prob. & Stat. Wiley, 1999, xii + 211 pp, 
$84.95. [ISBN 0-471-29414-4] 


Statistics, S(18), P. Geostatistics: Modeling 
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Spatial Uncertainty. Jean-Paul Chilés, Pierre 
Delfiner. Ser. in Prob. & Stat. Wiley, 1999, xi 
+ 695 pp, $125. [ISBN 0-471-08315-1] 
Theory of Computation, T(15-17: 1), S, P. 
Term Rewriting and All That. Franz Baader, To- 
bias Nipkow. Cambridge Univ Pr, 1998, x11 + 
301 pp, $49.95. [ISBN 0-521-45520-0] _Dis- 
cusses a branch of theoretical computer science 
based on equational logic which uses equations 
as directed replacement rules. Requires only 
basic discrete math background. Applications 
to algebra (word problem), software engineer- 
ing, recursion theory, functional and logic pro- 
gramming. LB 


Applications (Economics), P. Financial Mar- 
kets: Stochastic Analysis and the Pricing of 
Derivative Securities. A.V. Mel’nikov. Transl. 
of Math. Mono., V. 184. AMS, 1999, xiv + 
133 pp, $49. [ISBN 0-8218-1082-0] 


Applications (Social Science), S(16—17), P, L. 
Evaluation and Optimization of Electoral Sys- 
tems. Pietro Grilli di Cortona, et al. Mono. 
on Disc. Math. & Applic. SIAM, 1999, xvi 
+ 230 pp, $53 (P). [ISBN 0-89871-422-2] 
Presents a quantitative approach to the study of 
electoral systems based on single- and multiple- 
criteria optimization methods. Also discusses 
automatic procedures for designing electoral 
districts. Includes reflections on the political as- 
pects of election reform that provide perspective 
on the benefits and the limits of using mathe- 
matical models to design electoral systems. AO 


Applications (Systems Theory), T(16—18), P. 
Classical Control Using H™ Methods: Theory, 
Optimization, and Design. J. William Helton, 
Orlando Merino. SIAM, 1998, xvi + 292 pp, 
$45 (P). [ISBN 0-89871-419-2] Parts I-II 
provide an introduction to control systems de- 
sign; Parts III-V develop a theory for frequency 
domain CAD. Assumes reader has some back- 
ground in control theory (e.g., from a one- 
semester introductory course). 


Applications (Systems Theory), S(16—18). 
Classical Control Using H™ Methods: An In- 
troduction to Design. J. William Helton, Or- 
lando Merino. SIAM, 1998, xii + 171 pp, 
$18.50 (P). [ISBN 0-89871-424-9] Contains 
the design portions of the author’s text Classical 
Control Using H™ Methods: Theory, Optimiza- 
tion, and Design. (See preceding review.) 

Applications (Systems Theory), 1T(18), P. 
Indefinite-Quadratic Estimation and Control: 
A Unified Approach to H* and H™ Theories. 
Babak Hassibi, Ali H. Sayed, Thomas Kailath. 
Stud. in Appl. & Num. Math., V. 16. SIAM, 1999, 
Xvii + 555 pp, $80. [ISBN 0-89871-41 1-7] 


Applications (Systems Theory), P. Lecture 
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Notes in Control and Information Sciences— 
245: Robustness in Identification and Control. 
Eds: Andrea Garulli, Alberto Tesi, Antonio Vi- 
cino. Springer-Verlag, 1999, xiv + 467 pp, 
$105 (P). [ISBN 1-85233-179-8] Papers, 
many in a tutorial style, from a 1998 workshop 
held in Siena, Italy. 


Applications (Systems Theory), P. Lecture 
Notes in Control and Information Sciences— 
243: Progress in System and Robot Analy- 
sis and Control Design. Eds: S.G. Tzafestas, 
G. Schmidt. Springer-Verlag, 1999, xxvii + 
588 pp, $125 (P). [ISBN 1-85233-123-2] Pa- 
pers from the Third European Robotics, Intel- 
ligent Systems, and Control Conference (EU- 
RISCON ’98) held in Athens, Greece. 


Applications (Systems Theory), P. Lecture 
Notes in Control and Information Sciences— 
244: New Directions in Nonlinear Observer De- 
sign. Eds: H. Nijmeijer, T.I. Fossen. Springer- 
Verlag, 1999, xix + 525 pp, $119 (P). [ISBN 1- 
85233-134-8] Papers from a 1999 workshop 
held in Geiranger Fjord, Norway. 


Applications, T(16—-17: 1), L. NURBS: From 
Projective Geometry to Practical Use, Second 
Edition. Gerald E. Farin. AK Peters, 1999, xv + 
267 pp, $44. [ISBN 1-56881-084-9] NURBS 
(Non-Uniform Rational B-Splines) are widely 
used for describing objects in CAD/CAM and 
computer graphics applications. Text develops 
concepts and algorithms for describing curves 
and surfaces using NURBS. Second Edition in- 
corporates recent research results and a new 
chapter on Pythagorean curves (First Edition, 
TR, November 1995). AO 


Applications, P, L. Computation, Causation, 
and Discovery. Eds: Clark Glymour, Gregory 
F. Cooper. MIT Pr, 1999, xvi+ 552 pp, $45 (P). 
[ISBN 0-262-57124-2] Acollection of papers 
on data mining for causal relationships. (For 
example, to answer the question “Does an in- 
tervention that directly affects one part of a 
system indirectly alter another part?’’) In five 
parts: Causation, Representation and Predic- 
tion; Search; Controversy Over Search; Esti- 
mating Causal Effects; Scientific Applications. 


Applications, P. Modern Mathematical, Man- 
agement, & Statistical Sciences, I. Eds: Edward 
J. Dudewicz, Bruce L. Golden, Z. Govindara- 
julu. Amer. J. of Math. & Manag. Sci., V. 18, 
Nos. 3 & 4. American Sciences Pr, 1998, 
189 pp, $195 (P). [ISBN 0-935950-45-1] 


Reviewers 


KB: Karla Ballman, Macalester; LB: Lynne Baur, Car- 
leton; SN: Sam Northshield, Carleton; AO: Arnold Oste- 
bee, St. Olaf; HS: Heidi Shierholz, St. Olaf. 
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The course presents elements of discrete dynamical systems, basic probability theory, differential equations, 
matrix algebra, stochastic processes, curve fitting, statistical testing, and regression analysis. Computer analy- 
sis is extensively used in conjunction with these topics. 


You can also use this book if you are seeking applications to supplement a course in linear algebra, differen- 
tial equations, difference equations, probability theory or statistics 
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THE OHIO STATE UNIVERSITY 


The Department of Mathematics at Ohio State offers several options for graduate 
study: a broad Ph.D. degree program with specialization in nearly all branches of 
contemporary mathematics, pure and applied; an M.S. degree program with an 
elective focus in pure or applied mathematics; and a dual M.S. degree program with 
Computer Science. eS 


Students are encouraged to begin their studies at Ohio State in June. Summer 
Fellowships providing stipends, in addition to a waiver of all tuition and fees, are 
anticipated. Academic-year financial support is available in the form of Teaching 
Associateships and Fellowships with stipends ranging, approximately, from 
$13,500 to $15,000, in addition to a waiver of all tuition and fees. Summer support 
in the form of TA, RA and Fellowships is also available for almost 90% of 
continuing students. 


Further information and application materials are available from: 


THE OHIO STATE UNIVERSITY 
Department of Mathematics 
231 W. 18th Avenue 
Columbus, Ohio 43210-1174 
(614)292-6274 


e-mail: bonace @math.ohio-state.edu 
web site: http://www.math.ohto-state.edu 
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Leads students quickly to the key ideas of combinatorics in a logical and proactive way... 


Combinatorics This book teaches the art of enumeration, or counting, by leading the reader 
cag through a series of carfully chosen problems that are arranged strategically 


Orenen to introduce concepts in a logical order and in a provocative way. 
APPROACH 


The format is unique in that it combines features of a traditional textbook 
with those of a problem book. It is organized in eight sections, the first four 
of which cover the basic combinatorial entities of strings, combinations, dis- 
tributions and partitions. The last four cover the special counting methods of 
inclusion and exclusion, recurrence relations, generating functions, and the 
method of Polya and Redfield that can be characterized as “counting modulo 
symmetry.” The subject matter is presented through a series of approximately 
250 problems with connecting text where appropriate, and is supplemented 
by approximately 220 additional problems for homework assignments. Many 
applications to probability are included throughout the book. 


. . While intended primarily for use as a text for a college-level course taken by 
Combinatorics mathematics, computer science and engineering students, the book is suitable as 
A Problem Oriented well for a general education course at a good liberal arts college, or for self-study. 
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Textbook 


Daniel Marcus List: $28.00 MAA Member: $22.50 
Series: Classroom Resource Materials Solutions manual available with adoption orders. 
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Calculus Mysteries 
and Thrillers 


R. Grant Woods 


Series: Classroom Resource Materials 


Calculus projects you can give to your students... 


This book consists of eleven mathematics projects 
based on introductory single-variable calculus, 
together with some guidance on how to make use 
of them. Each project is presented as an amusing 
short story. In many of them a group of undergrad- 
uate mathematics students, formed into a consult- 
ing company called Math Iz Us, is hired to solve 
mathematical problems brought to them by clients. 
The problems solved include: helping to prosecute 
an accused pool shark, defending a driver accused 
of speeding, assisting a hockey. coach in making his 
star forward a more effective goal scorer, and advis- 
ing a pirate captain on how to divide a gold-plated 
goose-egg fairly among his crew. 


In each problem, the problem solvers are required 
to present to their client a detailed written report of 
their findings. Thus, students must produce and 
analyze accurate mathematical models of complex, 
verbally presented “real life” situations, and write a 
clear technical account of their solution. 


Instructors who are looking for problems that are 
novel, interesting, and several levels more complex 
than the typical text book “word problem” will find 
them in this book. _ It will be of particular value to 
instructors who wish to combine training in appli- 
cations of calculus with training in technical writ- 
ing. The complexity of the problems makes them 
suitable for use as group projects. 


The calculus concepts on which the problems are 
based include: tangent and normal lines, optimiza- 
tion by use of critical points, inverse trig functions, 
volumes of solids, surface area integrals, and model- 
ing economic concepts using definite integrals. 
Although a few ideas from physics and economics 
are used in the problems, no prior knowledge of 
these fields is required. 
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Packaged with a PC compatible disk that enhances the 
material in the text. 


Suitable for classroom adoption in an innovative course for 
e a general education mathematics elective 
e a mathematics or science major advanced elective 
e an interdisciplinary course, even at a relatively 
elementary level 
e a mathematical modeling course in a 
civil/environmental engineering program 


This book has a dual objective: first, to introduce the reader 
to some of the most important and widespread environmen- 
tal issues of the day; and second, to illustrate the vital role 
played by mathematical models in investigating these issues. 
The environmental issues addressed include: ground-water 
contamination, air pollution, and hazardous material emer- 
gencies. These issues are presented in their full real-world 
context, not as scientific or mathematical abstractions; and 
for background, readers are invited to investigate their status 
in their own communities. 

The first part of the book leads the reader through rela- 
tively elementary modeling of these phenomena, including 
simple algebraic equations for ground water, slightly more 
complex algebraic equations (preferably implemented on a 
spreadsheet or other computerized framework) for air pollu- 
tion, and a fully computerized modeling package for haz- 
ardous materials incident analysis. The interplay between 
physical intuition and mathematical analysis is emphasized. 

For more advanced readers, the second part of the book 
returns to the same three subjects but with a higher level of 
mathematical sophistication (adjustable to the preparation of 
the reader by selection of subsections.) Many important clas- 
sical mathematical themes are developed through this con- 
text, examples coming from single and multivariable calcu- 
lus, differential equations, numerical analysis, linear algebra 
and probability. The material is presented in such a way as to 
minimize the required background and to encourage the 
subsequent study of some of these fields. 

An elementary course for a general audience could be 
based entirely on Part I, and a higher level mathematics, sci- 
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Mathematical 


Modeling 
in the Environment 


Charles Hadlock 


Series: Classroom Resource Materials 


ence, or engineering course could move quickly to Part 2. 

A PC compatible diskette packaged with the text con- 
tains a spreadsheet program that facilitates the numerical 
experimentation with the Gaussian plume equation intro- 
duced in Chapter 3, as well as public domain DOS program 
(ARCHIE) for evaluating the consequences from various haz- 
ardous materials scenarios (e.g., the physical extent of flam- 
mable and toxic vapor clouds). Text is not tied to the use of 
this software, but it is included as an aid to meet the peda- 
gogical objectives of the text. 
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Instructor’s and Solutions 
Manual for Mathematical 
Modeling in the Environment 


Charles Hadlock 


Contains the complete solutions and further discussion of 
nearly every exercise presented in the textbook. This includes 
both the mathematical/computational exercises as well as the 
research questions and investigations. Readers will benefit 
greatly from perusing solutions to the problems whether they 
have worked them out themselves or not. Students using this 


volume will still need to work out solutions of research ques- 
tions using their own sources and adapting them to their own 
geographic locations, or using their own computational 
schemes, so this volume could well be useful for students in’ 
many course contexts. Enrichment material is included on 
the topics of some of the exercises. Advice for teachers who 
lack previous environmental experience, but who want to 
teach this material is also provided and makes it practical for 
such persons to offer a course based on these volumes. 
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- A Course in Mathematical Modeling — 


This book is intended as a text for a modeling 
course accessible to students who have mastered a 
one year course in calculus. It balances a variety of 
opposing modeling methodologies including theo- 
retical models versus empirical models, analytical 
models versus simulation, deterministic models 
versus stochastic models, and discrete models 
versus continuous models. Most of the examples 
are drawn from real-world data or from models that 
have been used in various applied fields. The use of 
computers in both simulation and in mathematical 
analysis is an integral part of the presentation. 


The authors emphasize the teaching of the model- 
ing process as opposed to merely presenting 
models. They begin their book with the simple dis- 
crete exponential growth model, and successively 
refine it to include variable growth rates, multiple 
variables, growth rates fitted to data, and the effects 
of random factors. The last part of the book moves 
into continuous-time models. Issues of model 
validity and purpose are emphasized throughout. 


Students taking a course based on this book should 
have some mathematical maturity, but will need 
little advanced knowledge. The book presents more 
advanced topics on an as-needed basis and serves 


Douglas Mooney and Randall Swift 


A Course in 
Mathematical Modeling 


Series: Classroom Resource Materials 


to show how the different topics of undergraduate 
mathematics can be used together to solve prob- 
lems. This perspective is valuable as either a road 
map for beginning students or as a capstone for 
more advanced students. The course presents 
elements of discrete dynamical systems, basic 
probability theory, differential equations, matrix 
algebra, stochastic processes, curve fitting, statistical 
testing, and regression analysis. Computer analysis 
is extensively used in conjunction with these topics. 


You can also use this book if you are seeking appli- 
cations to supplement a course in linear algebra, 
differential equations, difference equations, proba- 
bility theory, or statistics. 
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Visit <www.wku.edu/~swiftrj/Modeling/ model- 
ing.html> where you can visit the authors’ web- 
site and download data sets, Mathematica files, 
and other modeling resources that execute the 
models described in the text. 
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CRYPTOLOGY 


Albrecht Beutelspacher 


This fascinating little book is eminently readable, and it 
is a great deal of fun to peruse... the book is a real treat. 
We need more books like this, crafted by expert hands yet 
crafted so that the general reader can enjoy them. 
—Bulletin of The Institute of Combinatorics and 
Its Applications 


This excellent and entertaining book is suitable for a first 
course in cryptology for mathematical enthusiasts. An 
abundance of exercises and an excellent list of related ref- 
erences are included. 

—The Mathematics Teacher 


In spite of the light-hearted style in which the book is 
written throughout, it is a serious—-and successful—- 
attempt to explain the basis of coding and decoding mes- 
sages...I can strongly recommend this book to anyone who 
wants a brief but comprehensive, eminently readable, and 
up-to-date introduction to this increasingly popular topic. 
— The Mathematical Gazette 


All of cryptology is covered in this work...Occupying a 
niche in the halls of the ivory tower of pure mathematics 
for nearly two millennia, number theory now forms a pil- 
lar of modern society. This book is the best explanation 
available today of how that pillar was constructed. 

— Charles Aschbacher 


A model to follow in order to make mathematics better 
known and understood. Accessible to a broad audience. 
Have fun reading this book, while you are getting a better 
understanding of cryptology. 

— Bulletin of the Belgian Mathematics Society 


How can messages be transmitted secretly? How | 


can one guarantee that the message arrives safely 


in the right hands exactly as it was transmitted? 
Cryptology—the art and science of “secret writ- 
ing”—provides ideal methods to solve these prob- 
lems of data security. 


The book is fun to read, and the author presents 
the material clearly and simply. Many exercises 
and references accompany each chapter. 
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If you teach calculus, you should read this book. If you 
want to know what mathematics your students understand, 
or if you want to know how to find out what they under- 
stand, this book contains essential information for you. 


It doesn’t matter whether you teach a reform or traditional 
course, whether you have large or small sections, or 
whether you use lectures or laboratories. The bottom line 
is the same: When all is said and done, what counts is 
what our students understand. And that’s what Student 
Assessment in Calciltus is about. 


Over the last ten years calculus instruction has changed 
in numerous ways. Whether they were trying on new 
ideas or following the more traditional routes towards 
conceptual understanding, both individual] faculty and 
departments needed to khow if their instruction was 
effective. To help deal with that issue, the National Science 
Foundation brought together a Working Group of experts 
in students’ mathematical thinking, in assessment, and in 
calculus reform. The goals of their work were to: 
¢ develop a framework to tailor calculus instruction to the 
students’ needs; 
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Student Assessment 
in Calculus 


A Report of the NSF Working Group on 
Assessment in Calculus 


ALAN SCHOENFELD, EDITOR 
Series: MAA Notes 


e establish an agenda for further research on student 
understanding; 

¢ describe how to make use of a range of techniques to 
test what students know, such as multiple-choice tests or 
short essay questions, student portfolios and “clinical” 
interviews, 

* summarize major goals of the reform movement and 
describe the challenges faced by those who are taking a 
closer look at how students learn; 

e illustrate the ways in which calculus projects attempt 
(via exams, papers, projects, etc.) to find out what their 
students have learned. 


This book is the result of those efforts. If you teach calcu- 
lus, if you want to see examples of useful assessment tech- 
niques, or if you are interested in issues of how to mea- 
sure student learning in mathematics, then there is a lot for 
you here. 
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Robert Ash 


A Primer of Abstract Mathematics prepares the reader 

to cope with abstract mathematics, specifically abstract 
algebra. It can serve as a text for prospective mathe- 
matics majors, as well as for those students taking or 
preparing to take a first course in abstract algebra, or 
those in applied fields who need experience in dealing 
with abstract mathematical ideas. 


Learning any area of abstract mathematics involves writ- 
ing formal proofs, but it is equally important to think 
intuitively about the subject and to express ideas clearly 
and cogently. The author aids intuition by keeping 
proofs short and as informal as possible, using concrete 
examples which illustrate all the features of the general 
case, and by giving heuristic arguments when a formal 
development would take too long. The text can serve as 
a model on how to write mathematics for an audience 
with limited experience in formalism and abstraction. 


Ash introduces several expository innovations in A 
Primer of Abstract Mathematics. He presents an entirely 
informal development of set theory that gives students 
the basic results that they will need in algebra. The 
chapter which presents the theory of linear operators 
introduces the Jordan Canonical Form right at the begin- 
ning, with a proof of existence at the end of the chapter. 
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A Primer 
of Abstract 
Mathematics 


Series: Classroom Resource Materials 


Contents: 
Logic and Foundations: Truth Tables, Quantifiers, Proofs, 
Sets, Functions, Relations. 


Counting: Fundamentals, The Binomial and Multinomial 
Theorems, The Principle of Inclusion and Exclusion, 
Counting Infinite Sets. 


Elementary Number Theory: The Euclidean Algorithm, 
Unique Factorization, Algebraic Structures, Further 
Properties of Congruence Modulo m, Linear Diophantine 
Equations and Simultaneous Congruences, Theorems of 
Euler and Fermat, The Mobius Inversion Formula. 


Some Highly Informal Set Theory: Well-Orderings, Zorn’s 
Lemma and the Axiom of Choice, Cardinal Numbers, 
Addition and Multiplication of Cardinals. 


Linear Algebra: Matrices, Determinants and Inverses, The 
Vector Space F", Linear Independence and Bases, Subspaces, 
Linear Transformations, Inner Product Spaces, Eigenvalues 
and Eigenvectors. 


Theory of Linear Operators: Jordan Canonical Form, The 
Minimal and Characteristic Polynomials, The Adjoint of a 
Linear Operator, Normal Operators, The Existence of the 
Jordan Canonical Form. 


Appendix: An Application of Linear Algebra. 
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AMERICAN MATHEMATICAL SOCIETY 


These selections from the History of Mathematics Sources series and University Lecture Series 
enhance the classroom experience. Consider reeommending these texts to students for supplementary 
reading or independent study. For more AMS titles to broaden the learning experience, visit the 
University AMS Bookstore at www.ams.org/bookstore/. 


LECTURE 
Supplementary Reading Supplementary Reading 


Series 


Reagan cons’ | Riemannian Geometry During Non-Euclidean Geometry in the 

Halfof {he mwentieth | the Second Half of the Twentieth Theory of Automorphic Functions 
Century Jacques Hadamard, and Jeremy J. Gray, Open 
Marcel Berger, /nstitut des Hautes Etudes University, Milton Keynes, UK, and Abe Shenitzer, 
Scientifiques, Bures-sur- Yvette, France York University, Toronto, ON, Canada, Editors 
Masterfully written and delightful to read. In addition This is the English translation of a volume originally pub- 
to the numerous digressions for newly introduced lished only in Russian and now out of print. The book was 
concepts, the author adds to the value of the survey _ written by Jacques Hadamard on the work of Poincare. 
by providing fertile opinions, some of them his, Hadamard begins with hyperbolic geometry, which he 
others those of his close colleagues and of compares with plane and spherical geometry. In Chapter 


M. Gromov in particular. The wonderful effort of the 2 he presents the appropriate automorphic functions, in 
author is shown partially by the long bibliography of particular, Fuchsian functions. Chapter 4 is devoted to the 
thirty pages, with references updated right to the ——_gutlines of the more complicated Kleinian case. Chapter 5 


very end of the century. A person who wants to discusses algebraic functions and linear algebraic differ- 
learn more about Riemannian geometry will cer- _ ential equations, and the last chapter sketches the theory 
tainly do him/herself a good service by reading of Fuchsian groups and geodesics. 


Berger's work. —Mathematical Reviews This book is the second in an informal sequence of works 


Reprint arranged with the approval of the publisher called “History of Mathematics, Sources’, to be included 
saree B. G. Teubner, Stuttgart and Leipzig. within the History of Mathematics series, co-published by 
LECTURE | University Lecture Series; 1999: approximately the AMS and the London Mathematical Society. 
ns 217 pages; Softcover; ISBN 0-8218-2052-4: Members of the LMS may order directly from the AMS at the 
wade List $34: All AMS members $27: AMS member price. The LMS is registered with the Charity 
on Hilbert SSuraces. | Order code ULECT-BERGERMM91 1 Commissioners. 
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History of Mathematics, Volume 17; 1999; 95 pages; 
| A Classic | Softcover; ISBN 0-8218-2030-3; List $19; All AMS members 
$15; Order code HMATH/17MM911 


Hiraku Nakajima 


Lectures on Number Theory 


P. G. L. Dirichlet with supplements by 


R. Dedekind Lectures on Hilbert Schemes 

A new edition of Dirichlet's Lectures on Number Theory of Points on Surfaces 

would be big news any day, but it's particularly gratifyin ; . eee 

to see the Fok see a8 “the fist of an oral 9 Hiraku Nakajima, Kyoto University, Japan 

sequence’ which is to include “classical mathematical The Hilbert scheme X!"! of a surface X describes collec- 

works that served as cornerstones for modern mathemat- _ tions of n (not necessarily distinct) points on _X. More 

ical thought.” So all power to the American Mathematical _ precisely, it is the moduli space for O -dimensional sub- 

Society and the London Mathematical Society in their schemes of X of length n. Recently it was realized that 

joint-venture History of Mathematics series: May the Hilbert schemes originally studied in algebraic geometry 

“Sources” subseries live long and prosper. [T]his is quite are closely related to several branches of mathematics, 

accessible, and could almost be used as a textbook still such as singularities, symplectic geometry, representation 

today. For those who like to heed Abel's admonition to theory-even theoretical physics. The discussion in the 

“read the masters, not their students,” here’s a great book reflects this feature of Hilbert schemes. 

opportunity to learn more about Number Theory. __ For example, a construction of the representation of the infi- 
—MAA Online pite dimensional Heisenberg algebra (i.e., Fock space) is 

Co-published with the London Mathematical Society. presented. This representation has been studied extensively 

Members of the LMS may order directly from the AMS at the —_ in the literature in connection with affine Lie algebras, con- 
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